SEQ ID NO: 1 BLM gene cluster ORFS 3 0 through 8 

(note orf 31-40 on sequence 1-18660 (are translated on the reverse strand and on a 
separate file) 

18601 ACCCATCTCATAGGTGTACGCGCTGGAGtATTCGGGGCACGACGGAAGGTTCTCGGTCAC 18660 

18661 GAGAGCACTGTAAGCCCGAACCCGCAAGGATGACGAATTGCAAAATTGTGCAAGTCGCTA 18720 

18721 CATGATGGTCCGGCTGTGCCCGCAGGTAC^CCGCGGGCACAGCACCAGACGCTGCCTCCGC 187 80 

18781 GCACCGCGCGGGAGGCCCGGTGAGGCGAG^GGCTGAGGTTCCGTGCCGGTTCCGCTGTAT 1884 0 

M P V P L Y (orf30) 

18841 CAGGCGAAGGCCGAGTTCTTCCGGATGCTGbGGCACCCGGTCCGCATCCGCGTACTGGAG 18900 
QAKAEFFRMLGHPVRI RVLE 

18901 CTGCTGCAGGACGGGCCGATGCCGGTGCGTGATCTGCTGGCGGCGATCGAGATCGAGCCC 1896 0 
LLQDGPMPVROLLAAIE I EP 

18 961 TCGGCGCTGTCCCAGCAGCTGGCGGTGTTGCeCCGCTCGGGCATCGTGACCTCCACCCGC 190 20 
SALSQQLAVL Rl RSGIVTSTR 

19021 ACGGGTTCCACGGTCGTCTACGAGCTGGCCGCTTGGCGACGTGGCGGAGCTGATGTCCGCC 19080 
TGSTVVYELAGlGDVAELMSA 

19081 GCGCGCCGCATCCTGACCGAGATGCTCAATGGGCAGCACGAGCTGCTGGAGGAGCTGAGG 19140 
ARRI LTEMLNGlQHELLEELR 

19141 GAAGCCGAGGTCAGTGCCCGGTGAGCTCCCTCqpCGTCCGGGTGGGAGCCCGGGTGCGTT 192 0 0 
EAEVSAR * 

MS S LAlVRV .GARVRS (orf29) 

192 01 ccgtgctgcccacccgcgccgacctcgcgggca r flgggccgcagcccgcgacgtgatctac 192 6 0 
vlptradlagmIgrsprrdll 

19261 tggccggtctgaccgtggcgatcgtggccctgccsctcgccctcggattcggcgtctcct 19320 
agltvaivalpllalgfgvss 

19321 ccggtctcggcgcggaggcagggctggccaccgcggtggtggcgggcgcgctggccgcgg 193 80 
glgaeaglatavvagalaav 

193 81 tattcggtgggtcgaatctccaggtgtccgggccclcgggcgccatgaccgtggtcctgg 1944 0 
fggsnlqvsg pt^gamtvvlv 

( 19441 tgcccatcgtcgcccggtacggccccggcggtgtcctcacggtcggcctgctcgccggac 19500 
pivarygpggv l^t v g l l a g l 

tgatgctgatcgcgctcgccctcgcccgcgccggccggtacatgcagtacgtgccggccc 19560 
mlialalaragr\ymqyvpap 




^ M 



cggtggtggagggcttcaccctcggcatcgcctgcgtgatcggcttgcagcaggtgccga 1962 0 
vvegftlgiacviglqqvpn 

19621 acgccctgggagtcgccaagccggagggcgacaaggtcctcgtcgtgacctggcgcgcgg 19680 
algvakpegdkvl\vvtwrav 

19681 tcgagaccttcgccggggcgcccaactggaccgctgccggactggcggcagcggtcgccg 19740 
etfagapnwtaag\laaavaa 

19741 cggtcatgctgaccggcgcgcggtggcggccggtcgttccgttctccctcctcgcggtga 198 00 
vmltgarwrpvvpVsllavt 

. \ 

19801 ccggtgccaccgtcgtggcccagctgtgccacctggacgcggcccgcccgatcggggacc 19860 
gatvvaqlchlda^rpigdl 

19861 tgcccgcggggctgcccgccccgtcgctggccttcctggacctcggagcactgggctcgc 19920 
paglpapslafldlIgalgsl 



D L\ 



19921 tgctggcgcctgccgtggccgtggcggcccttgccgcgttggaiatcgctgctgtcggcgt 19980 
lapavavaalaale\sllsas 



19981 CCGTCGCGGACGGGATGACGGTCGGGCAGAAGCACGACCCGGACAAGGAGCTGTTCGGGC 2004 0 
VADGMTVGQKHDPDKELFGQ 

20041 AGGGTCTCGCCAACCTGGCCGCCCCGCTGTTCGGCGGCGTCCCGGCCACCGGCGCGATAG 20100 
GLANLAAPLFGGVPATGAIA 

20101 CCCGCACCGCCGTCAACGTCCGTACCGGTGCGAGCTCGCGACTGGCGGCCCTCACGCACG 2 0160 
RTAVNVRTGAS SRLAALTHA 

20161 CCGCGATCCTCGCCGTCATCGTCTTCGCCGCCGCCCCACTGGTCTCCCGCATCCCCCTGG 2 02 20 
AILAVIVFAAAPLVSRI PLA 

2 0221 CCGCGCTCGCCGGCGTGCTGATCGCGACCGCGATCCGCATGGTCGAAGTGGGCAGCCTGC 2 02 80 
ALAGVLIATAIRMVEVGSLR 

202 81 GGGCGATGGCCCGCGCCACGCGCTCCGACGGCCTGGTACTGATCCTCACGGCGGTCGCCA 2 034 0 

AMARATRSDGLVLI LTAVAT 

203 41 CCGTGGCCCTGGACCTCGTCTACGCCGTCATCATCGGCCTGCTGGTCGCCGGCGCACTCG 2 04 00 

VALDLVYAVI IGLLVAGALA 

204 01 CCCTGCGGGC CGTGGCCAAGCAGGTCCGCCTGGACCAGGTCTCCTTGAAGGAGGACCTGA 2 04 6 0 

LRAVAKQVRLDQVSLKEDLT 

204 61 CCGGCGACCACAGCGCCGAGGAACACGCGCTGCTCGCCGAGCACATCGTGGCGTACCGCA 2052 0 
GDHSAEEHALLAEHIVAYRI 

20521 TCGACGGTCCGCTGTTCTTCGCCGCGGCCCACCGCTTCCTGCTGGAACTCTCGGACGTCG 205 80 
DGPLFFAAAHRFLLELSDVA 

20581 CGGACGTGCGCGTGGTGATCCTGCGCATGTCCCGCGTGACCACCATGGACGCCACCGGCG 2 064 0 
DVRVVI LRMSRVTTMDATGA 

20641 CCCTCGTCCTGAAGGACGCGGTCACCAAGCTGAACCGGCGCGGCATCACCGTCCTGGCCT 2 07 00 
LVLKDAVTKLNRRG I TVLAS 

20701 CCGGGGTACGCCCCGGCCAGCGCCGGGTCCTCGACTCCGTCGGCGCCCTCGGTCTGCTCC 20760 
GVRP-GQRRVLDSVGALGLLR 

2 0761 GGGCCGCCACCGGCGACGACTACACCGGCACTCCCGAAGCCATCGCCGCCGCCCGAAGCC 20820 
AATGDDYTGTPEAIAAARSH 

2 0821 ACCTGCACGGCGCCGGTGTCCTGGCCCCCGCCTGCCCGGGCCCGCCTCCTCCGGTACCCC 2 08 80 
LHGAGVLAPACPGPPPPVPP 

2 08 81 CACCGTGCGCTCCGAGTGCCCGACGATGAGGAGCCGACCGAGGTCCTCCTCCGTCACCCG 20 940 
PCAPSARR* 

2 0 941 GACACCCACGGTTGCGCCGCCCCATGCCGGCGGTCCCTCCTGACGGCCCGTCCGCGGCTT 210 0 0 

210 01 G AGG CGG CGGTGG ACGG C CTG C CG C CG C CGG CCTCGGG CTG AT CGG CGTGAT CAC CGC C C 21060 

21061 ATGCGCGGGTGGGCGCCCGCGGCATCGTGGGCGGGACCGTGTTCCCGGCCACCGCGGCGG 2112 0 

21121 CCGGCCTCGCGCTGGGCGTGGCCTGCCGCGGTGCCTGGTAGCGGCGGGGTCCGGCGGCCG 21180 

21181 GGCCTGTGCTTCTTCCCGCCCGTCCGGCGGGTGGCGCCGCGCCGGCGGTGACAGGGAAAT 2124 0 

21241 ATGACCGGAACTGGGATGCTCGCGTCCACTCGGGTGTGTTTAAGTGCCACGGGGGCTTCC 213 00 

213 01 GACGGCGCGTCGCGCGCCGGCGGTTCGCCCGATGATGGTCGTGCGGCGCTGTGAGCCGGG 213 6 0 

21361 GAGCCTATGGCACAGGACCTGAACGACTGGATCGAGGACGAGGTCGTCCCTTACGAGGAG 214 2 0 

MAQDLNDWIEDEVVPYEE (orf28) 

214 21 AAGCCTCTCGAATGGATCTCCCAGTACCACTTCTTCCGCGACCCGGCGCGAGCCGCCTAT 214 80 

KPLEWI SQYHFFRDPARAAY 

214 81 GTCGATCACACCTACTTCTTCTCACCGGCCGATGGCGCGATCGTCTACCAGAAAGTAGTG 2154 0 
VDHTYFFSPADGAIVYQKVV 

21541 GATCCCCAGGAGTCGATCATCGACATCAAGGGGAAGCCGTACTCGCTGGCCGCCGCGCTC 216 00 
DPQESI ID I KGKPYSLAAAL 



2 



21601 CGTGACGAATCGTTCGGTCACCGGTGCCTGGTGATCGGCATCTTCATGACCTTCTTCGAC 21660 
RDES FGHRCLVIGI FMTFFD 

21661 GTGCACATCAACCGGATGCCTTACGGCGGCCGTCTCTCCTTCGCGCTCAAGGAGCCCATC 21720 
VHINRMPYGGRLSFALKEPI 

21721 GGGACGTTCAACCTCCCCATGCTGGCCATGGAGCAGGACCTGCTCGAACGGCTCCGGGTC 21780 
GTFNLPMLAMEQDLLERLRV 

21781 AATCCGGCTCACGCGAGGTATCTGCACCTGAACGAGCGGATGGTCAACCGGGTCGACGCG 2184 0 
NPAHARYLHLNERMVNRVDA 

21841 CCGCGGCTCCGGGGCCCGTACTGGATGCTCCAGATCGCCGACTACGACGTCGACTCCATC 21900 
PRLRGPYWMLQIADYDVDS I 

21901 ACCCCGTTCTGCAGACGGCAGGGAATGTTCCGCTCCCAGGGGCGCCGCTTCTCCCAGATC 2196 0 
TPFCRRQGMFRSQGRRFSQI 

21961 CGCTACGGATCGCAGGTCGACCTGGTGATCCCGATGGCGGCCGACCGCGAGTACGTCCCC 22 0 2 0 
RYGSQVDLVI PMAADREYVP 

22 021 GTGGAGGCCGTCGGCCGGCACGTGAAGGCGGGGCTCGACCCGCTCGTCAAGATCCGGTGG 22 08 0 
VEAVGRHVKAGLDPLVKIRW 

22 081 CGTTGAAGAGCGCGTACGAAGCGATGGCGAACTGGAGGGACACAGCGTGGGTTTCCGTCG 2214 0 

R * M G F R R (orf27) 

y 22141 AGCGCAGAGGGCCGGTGGGCCGGGAGCGGGCCGGCGGGAGAGCGCCCGGTTCAGGCCGGA 222 00 

£ AQRAGGPGAGRRESARFRPD 

1 2 22 01 CGGGCCGTCGGCGCCGCGGGACCGTCCGTTACCCCTGTCCGCCGGGCAGTTGTTCGAGTG 22260 
^ GPSAPRDRPLPLSAGQLFEW 
3 ...... 

fj 2 2261 GGTGTTTGACAAGCTCGTCGACGGAGATCTGAGCCACCAGCCGACGATTGTGCGGCTCCG 223 2 0 
I* VFDKLVDGDLSHQPTIVRLR 

22 321 CGGCCCGCTGAACACCGCCGCCCTGCGGATGGCCTACGCCCGGCTGGTGCGGCGCCACGA 223 8 0 
GPLNTAALRMAYARLVRRHE 

22381 GTGCCTGCGCACCCGCTTCCCCGTGATCGACGGGGAGCCCGTGCAGGTGATCGAGGGCAT 224 4 0 
CLRTRFPVI DGEPVQVI EGI 

22441 CGGGAAAGCAGCGGGGGGCCCGCTGCCGCTCATCGATCTGCGCCACCTCCCGGAGGCGCT 2250 0 
GKAAGGPLPLIDLRHLPEAL 

2 2501 TCGCGCGCGCGAGATCGCGAGGATCCGCGAGGAGACGCTGTCCACGCCGGTCCCCTTCGA 22 56 0 
RAREIAR IREETLSTPVPFD 

22561 CAAGCGGCCGCCCGTCCGCGTGGCGCTGATCCGGGCGGCGCC CGAGGAGCACCTCTTCCT 2262 0 
KRPPVRVALIRAAPEEHLFL 

2 2621 CGTCGGCATCCCGCACATCACCGCGGACCTGTGGTCCGCGACCCTGCTCAACGACGAGCT 2268 0 
VGIPHITADLWSATLLNDEL 

22681 CATGG CG C ACT AC AGGG CGGGGG C CG AGGGG ACT C C C T C C CGGG CC C C CACC C C CGT CG C 22740 
MAHYRAGAEGTPSRAPTPVA 

22741 GC AGTACGCCGACTTCGCGCAGTGGCAGCGCGCGTGGTGGAACCGGGACCGCACCGAGCG 228 00 
QYADFAQWQRAWWNRDRTER 

22801 GGAGGCCGGACGGTGGCGGGCGCGGCTGGACGGGCTGTCCGCCGTGGAACTGCCCCTGGA 2 2 860 
EAGRWRARLDGLSAVELPLD 

22 861 CCGGCCCCGCCCCGCGGGCCGCCGGCGGGACTGCTTCCTGATCGGGGACACCTTCGACGC 22 92 0 
RPRPAGRRRDCFLIGDTFDA 

22921 CGAACTGAGCGACCGGCTGCGCGCCTTGGCACGCACCGCCGACGTCACGCTGTACGTGGT 2298 0 
ELSDRLRALARTADVTLYVV 

22981 G CTG CTGG CGG CGTT C C ACTGGCTGGTGGGGCGG ATGT CGGG CGC CGG C CGG CTGGTG AC 23 04 0 
LLAAFHWLVGRMSGAGRLVT 

2 3041 CACCT CG CTCGTGG CCG C CCGGCACGG CAGCGCGGTAC AGGGG ATG AC CGG C C CGTTCT C 23100 
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it*? 



TSLVAARHGSAVQGMTGPFS 

23101 GGACTACCTGGCCCTGGTCGGGGACCTGTCGGGCGATCCGGACTTCCTGGAGTCCCTGCG 2 3160 
DYLALVGDLSGDPDFLESLR 

23161 CCGCGTACGCGACGAGTGCCTGACCGCCCACGACCACCAGCGGCTTCCGTTCTCACAGGT 2 322 0 
RVRDECLTAHDHQR LPFSQV 

23221 CCTCGAAGTCATGGACCCCGGACGCGAGTTGCACCCCCATCCGCTGGAGCAGCTCGGGTT 2 3280 
LEVMDPGRELHPHPLEQLGF 

23281 CAACCTCCACAACATCCCTCCCGCGGTCATGGACTTCTCCGGCGACGTCGTCGTCTCGGC 2 334 0 
NLHNI PPAVMDFSGDVVVSA 

23341 GGTGAACCCGGAGGGGGACGACGGGGAGAGCGGCGACGGGGAGTACGTGCCCTGGACCGC 2 34 00 
VNPEGDDGESGDGEYVPWTA 

234 01 CGACCTGACCTTCGACGTCTACGACTACGGCACCGGCCATATGCCGTTCGACGTGATACT 2 34 6 0 
DLTFDVYDYGTGHMPFDVIL 

23461 CGACCGGCGGCTGGCCGATCCGGCGACGGCCCGGGAGTGGGCCGGGCACTACCGGTCGGT 2 3 52 0 
DRRLADPATAREWAGHYRSV 

23521 GCTCCGTGCGGTCGTCGCCGACCCCGGCGTGCGCCTGTCCGCCCTCGGCACCCTGCTGTC 2 3 58 0 
LRAVVADPGVRLSALGTLLS 

23581 CCTGCCGCGACCGCCGTCCGCCACGTCCTTCGGCGGCCGGGAGATCGACGTCCGGCGCGT 2 3 640 
LPRPPSATSFGGREIDVRRV 



23 641 CGAACGCGAGTTGGCGGGGCGCGACGGGATCACCGCCGCCCTGGTCGCGGTGGCGCCCCG 2 3 700 

ERELAGRDG I TAALVAVAPR 

^4 ...... 

, » A J 23701 GCGCCTGGCCACCGGGCTGCGCGTACGGGAACTGGTCGCCTACTGCGCCGTCGAGGGCAC 2 3 760 
2 RLATGLRVRELVAYCAVEGT 

ufl 23761 GCCGCGTCCGAACGCGGCCCACGACATCCGCGGCCGCCTGCGGGAGCGCCTGCCCGACGG 2 382 0 
>|j PRPNAAHDI RGRLRERLPDG 

. ^ 23 821 CTGGGTGCCGACCGTGTTCGTCGAGCGCCCGCCGGAGGAGATCCGGAAGGCCCTGGCCGC 2388 0 
=P WVPTVFVERPPEEI RKALAA 

...... 

2 3 881 CCGGGCGGCGGGCGGCGAACGGGCGGAGCCGCTGCCGCCGCCCGAGGACTGCGTCCCGCT 2 3 940 
RAAGGERAEPLPPPEDCVPL 

' □ 23 941 TCCCGAGGAGGGCCGGCCCCCCTCGGACCCGTCCGAGCGGCGGCTGGCCGCGCTCTGGGC 24000 
IS, PEEGRPPSDPSERRLAALWA 

24001 CGAGATCCTGGGCGCCCCGCCGAAGAGCGTGACCGAGCCCTTCTTCCGCGTCGGCGTCAC 24 060 
E I LGAPPKSVTEPFFRVGVT 

24 061 CGATAAGGACGCCCTCCGCTTCCTGGCCCGCGTGGCGGAGGACTTCGGCGTCACCGTGCC 2412 0 

DKDALRFLARVAEDFGVTVP 

24121 CTTCGCCGACTTCCTCAGCGCTCCCAACCTGCGTATGGTGAAGGACAATTTGGCTGAGAA 24180 
FAD FLSAPNLRMVKDNLAE K 

24181 ACGGAGGGTGTAACGCGCAATGAGTGAGTGGTAGGGTCGGAATCGAACCGCACTGATCGG 24 240 
R R V * 

24 241 CAATCTTTTCGGTCAGCTGTTCCGGATATTCCGGGGCGCGTCGGCGCTCCCTCGACCAAG 24 3 00 

24301 GGCGTACGCGGATAAGCGTGCGCCGCCCCACGGCTGCGTCTCGACGCCTTCATCGGCGCG 24 3 60 

24361 TCGGACACTTCGCGGTGCCAGTCGGCACGCTCAGAGATCAGTGGAATGCCTCGGTGTGCC 2442 0 

M P R C A (orf26) 

244 21 CGAGGTGCGCTCAGTACTGCTGTCCACACAACGCGCCAAGGGAGTTGGAACGTGATGGAG 244 80 
RGALSTAVHTTRQGSWNVME 

24481 ACGGCGAATTCCGGCTATCGGGTCTCACCTCAGCAGCGGCATTTATGGGCCATGCTGACC 24 54 0 
TANSGYRVS PQQRHLWAMLT 

24 541 CGCGGGCGGGACGGCGGGCGACGTGCGTTCACCCAGTCCGCCGTGGTGGTCGACCGTTCC 24 600 
RGRDGGRRAFTQSAVVVDRS 
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24 601 CTGGACGCCGCACGTCTGCGCGCCGCGCTGGCCTCCGTGGTGGCCGCCCACGAGCCGCTG 24 66 0 
LDAARLRAALASVVAAHEPL 

24 661 CGGACGACCTTCACCGGTCTCGCGGGACGGACCGCGCCGGTCCAGGTCGTCCATGACCCG 24 720 
RTTFTGLAGRTAPVQVVHDP 

24 721 GACGAGCAGCCGCTGTCCGTCGTCGACCTGCCGC CCTCGTGCGCCGACGGCTCGGGCCCG 24780 
DEQPLSVVDLPPSCADGSG'P 

24 781 GAACTGGACGAGCTCCGGCTCCGCGAACGCGCCGCCCTCGACCCGCGCGGCGGGCCCGTC 2484 0 
ELDELRLRERAALDPRGGPV 

24 841 TTCCGGGCCGCCCTGGCGCGGGCCGGCGAGGACCGGGCGGTGCTGGTGCTCACCGCGCAC 24 900 
FRAALARAGEDRAVLVLTAH 

24 901 GCCCTGGTCGCGGACCGGCTCTCCCTCCGGCTGCTGGCCGGGCAGATCCTCGCGGCGTAC 24 960 
ALVADRLS LRLLAGQ I LAAY 

24 961 AGCGGGGAGACCGTGTCCCCCGATGGCCCGCCGCCCTTGCAGTACGCCGACTTCGCCGCC 2 5 02 0 
SGETVS.PDG P P P LQYAD FAA 

2 5 021 TGGCACCACGACCTGCTCACCGCCGAGGACGCCGCCCCCGACCGCGCGCACTGGGCCGCC 2 5 080 
WHHDLLTAEDAAPDRAHWAA 

25081 CACACCGCCACCGCCGGCACCGGGCCGCTCCCCGGCGTCGTACGGCCCGGCGCCGCCCCG 2514 0 
HTATAGTGPLPGVVRPGAAP 

rs 1i 

i-Xi- ...... 

t Q 25141 GGTCCGTGGCGGGCGCGGGAGTGGGAACTGCCCGCCGAACTGGTGGCGGGGATCGACGGC 2 5200 

% GPWRAREWELPAELVAGIDG 

H 252 01 GTCGCCGGGAAGCTGTCCACCGATCCCGCCACCGTGCTGCACGCCGCCTTCCGTATCGCG 2 5260 

*J VAGKLSTDPATVLHAAFRIA 

;!j 2 5 261 GTCTGGCGGCTCGCCGGCGAGCGGAACCTGCCCGTCGCCCTCACTCGTGACGGCCGTTCC 2 532 0 

!N VWRLAGERNLPVALTRDGRS 

2 5321 CACCCCGAACTCCGCACCGCGATCGGCGCCTTCGAGCGTGAGCTCCCGCTCGTCCACGAG 2 53 80 
^ HPELRTAI GAFERELPLVHE 

^ 253 81 ATCCGTCACGAGACGGCGTTCGCGGAATACGCGCGCGCTCTGGACGCGCTCGTCGCCGAG 2 5440 

IRHETAFAEYARALDALVAE 

, 0 2 54 41 GGCGAGGAACTCCTCGACCATTGCGACCCGGAACTGCTCGGCAGCCTCGACGGCACCGCG 2 5500 

P GEELLDHCDPELLGSLDGTA 

2 5501 GAAGGGCCCTGCTTCACCTTCACCCACCACCAGGCCGAAACACCGGTCCGGCGGGCCGGC 2 5 560 
EGPCFTFTHHQAETPVRRAG 

25561 ATCACCTTTACCACCGTCCATCAGGATTCGGGTACGCCGATTCCCGTCCGCCTGACCGCC 2 5 62 0 
ITFTTVHQDSGTPI PVRLTA 

25621 CGACG CG ACGG CG C C CGG CTG CG C ATGG AACTGGG AT ACGACGAGGGC CGTATCG ACG AG 2 5 68 0 
RRDGARLRMELGYDEGRIDE 

25681 ACGTTTCCCGAGAACGCCGCCGCCTGCCTCACCCGCATTCTCGAAGGCGTCGTCTCCGCC 2 574 0 
TFPENAAACLTRILEGVVSA 

25741 CCCGAGGGCCCGGTCGGCGACATCCGCATGCTGTCGGACGAGACCGCACGGCTGCTCCGG 25800 
PEGPVGDI RMLSDETARLLR 

25801 GAAGCGGGGCTGGGCCCCCGCGTGGAACTTCCCGGCAAGGCGGTCCACGAACTCTTCGCC 2 5860 
EAGLGPRVELPGKAVHELFA 

25861 GAGCAGGCCGCGCGCACCCCCGGGGCGGTCGCGGTCAGCGCGGGCGAGGACGCCCTCACG 25920 
EQAARTPGAVAVSAGEDALT 

2 5 921 TACGCCGAACTCGACGAGCGGTCCAACCGCCTGGCACACCACCTGACCGGGCTCGGGGTG 25980 
YAELDERSNRLAHHLTGLGV 

25981 ACACCCGGCCGGCACGTCGTGGTCTCGGTCGGCCGCTCCGCCGAGCTGCTCGTCGGGCTG 26040 
TPGRHVVVSVGRSAELLVGL 

26 041 CTCGGCGTGCTCAAGGCGGGTGGCGCCTTCGTCCCCGTCGACGTGGGCTTCCCCCGCAAA 26100 
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LGVLKAGGAFVPVDVGFPRK 

2 6101 CGGCTGGAGTTCGTGCTCCGGGAGACCGCCGCGCCGGTCCTGCTCTGCACCGCCGACGTA 2 6160 
RLEFVLRETAAPVLLCTADV 

2 6161 CGGGACCGCATCGGCACTCGGACCCTCGACGACGCCGGGGTGACACCCGTCGCGCTGGAC 262 20 
RDRI GTRTLDDAGVTPVALD 

2 6221 GCCGACCGGCGGCGCATCGCCGCACACCCCGCCGGCCCCACCGGCATCGCCACCACCCCC 262 80 
ADRRRIAAHPAGPTGIATTP 

2 6281 GACGCCCCCGCGTACGTCGTCTACACCTCCGGCACCACCGGGAAGCCCAACGGCGTACGC 2 634 0 
DAPAYVVYTSGTTGKPNGVR 

2 6341 GTCCCGCACCGGGGCCTCACCAACTACCTCACCTGGTGCACCGGCGCCTACGGACTCGAC 2 6400 
VPHRGLTNYLTWCTGAYGLD 

2 64 01 GGGGGCACCGGCACCCTCGTGCACACCTCCATCAGCTTCGACCTCACCCTCACCACCCTG 264 60 
GGTGTLVHTS 1 SFDLTLTTL 

2 6461 TTCGGCCCCCTGCTCGCCGGCGGGCAGGTGGTCATGCTCTCCGAGACCGCCGGCGTGACC 26520 
FGPLLAGGQVVMLSETAGVT 

2 6521 GGCCTGATCGCCGCGCTGCGCTCCCGGCGCGACCTCACCCTGGTCAAGCTGACCCCGACC 265 80 
GLIAALRSRRDLTLVKLTPT 

^ 26581 CACCTCGACGTCGTCAACCAGCTGCTCACCCCCGACGAGCTGCGCGGCGCGGTCCGCACC 2664 0 

! ''r; HLDVVNQLLTPDELRGAVRT 

ip 2 6641 CTCGTCGTCGGCGGGGAGGCGGTGCGGGCGGAGAGCCTGGAGCCGTTCCGGGCCTCCGGG 267 00 

LVVGGEAVRAES L E PFRASG 
g ...... 

'2 2 6701 ACGCGGGTCGTCAACGAGTACGGGCCCAGCGAGACGGTCGTCGGCAGCGTCGCGCACGTC 26760 

f»y TRVVNEYG PSETVVGSVAHV 

2 67 61 GTCGACGCCGCCACGCCCCGTACCGGCCCGGTGCCCATCGGCCGGCCGATCGCCAACACC 2682 0 
VDAATPRTGPVPIGRP IANT 

Q 2 6821 ACCGTCCACCTGCTCGACCAGCGGCGGCGGCCCGTCCCCGACGGCGTCGTCGGCGAGCTG 268 80 

TVHLLDQRRRPVPDGVVGEL 

Q 26881 TGGATCGGCGGCGCCGGTGTCGCCGACGGCTACCTGGGGCGGCCGGAACTCACCGGCGAG 2694 0 

i s jff WIGGAGVADGYLGRPELTGE 

; J 2 6 941 CGCTTCCTCCCCAGCGACTACCCGCCGGACGGCGGCCGGGTCTACCGCACCGGCGACCTG 27000 

U RFLPSDYPPDGGRVYRTGDL 

2 7 001 GCCCGCCGGCGCGCCGACGGCACCCTGGAGTACCTCGGGCGCACCGACGCGCAGGTGAAG 270 60 
ARRRADGTLEYLGRTDAQVK 

2 7061 ATCCGCGGCGTCCGGGTGGAGCCCGCCGAGACCGAGGCCGTCCTCGCCTCCCACCCCGGC 27120 
IRGVRVEPAETEAVLASHPG 

27121 GTCGGCCAGGCCGTCGTGGTCGCCCGGCTGGACGAGGACCCCGGCCGTTCGTCGCCGCTC 27180 
VGQAVVVARLDEDPGRSS PL 

27181 GCCGGCGAGCTGACGCTGACCGGCTACGTGGTCCCGGCCCGCGGTGCCCAGGCGCCCCCG 27240 
AGELTLTGYVVPARGAQAPP 

2 7241 CACGAGGAGCTCATCGCGTACTGCCGGGAGCGGCTGCCCGAGCACTTCGTCCCGGCCGTC 273 00 
HEELIAYCRERLPEHFVPAV 

2 73 01 CTCGTCACCCTCGACGCCCTGCCCGTCACCGGCCACGGCAAGATCGACCGCGGTGCGCTG 2 73 60 
LVTLDALPVTGHGKI DRGAL 

2 7361 CCCAAGCCGCACGCCCGGGCCCGGGACGGCGCGGCGTACGTCGCGCCGCGCACCGCCACC 2 7420 
PKPHARARDGAAYVAPRTAT 

2 7421 GAGGAGATCCTCGCGGCCACCGTCGCGAAGGTGCTGGGCGTCGAGCGCGTCGGCATCGAC 27480 
EE I LAATVAKVLGVERVGID 

27481 GAC AAC T ACTT CGTC CTGGGCGG CG AC TCCAT C CG C AG CGTC ATGGT CG C CAG C CGGG C C 27540 
DNYFVLGGDS IRSVMVASRA 
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% % 

2 7541 CAGGCCCGCGGGGTCGAGGTCACCGTGGCCGACCTGCACCGGCACCCCACCGTCCGGGCC 27600 
QARGVEVTVA DLHRHPTVRA 

2 7601 TGCGCCGCGCACCTGGACGCCCGCGAGGACCTGCCGCGGACGCCCGTCACCGAACCCTTC 27660 
CAAHLDAREDLPRTPVTEPF 

2 7661 GCGCTGATCTCCGCCGAGGACCGGGCGCTGGTGCCGGACGACGTCGAGGACGCCTTCCCG 2772 0 
ALISAEDRALVPDDVEDAFP 

2 7721 CTGAACCTGCTCCAGGAAGGCATGATCTTCCACCGCGACTTCGCGGCGAAGTCGGCCGTC 27780 
LNLLQEGMI FHRDFAAKSAV 

2 7781 TACCACGCCATCGCGTCCGTGCGGCTGCGCGCCCCGTTCGACCTCGCCGTGCTGCGGATG 27 84 0 
YHAIASVRLRAPFDLAVLRM 

2 7 841 GTCGTGCGCCAGCTCGTCGAGCGGCACCCGATGCTGCGCACCTCCTTCGACATGAGCCGC 27900 
VVRQLVERHPMLRTSFDMSR 

2 7901 TTCAGCCGCCCGCTGCAACTGGTGCACCGCGAGTTCGCCGATCCGCTGCACTACGAGGAC 2 7960 
FSRPLQLVHREFADPLHYED 

2 7961 CTGCGCGGCAGGAGCGCCGAGGAGCAGGACGCCCGCGTCGAGGAGTGGATCGAGCGGGAG 2 802 0 
LRGRSAEEQDARVEEWIERE 

2 8021 AAGGAACGCGGCTTCGAGCTGCACGAGTTCCCGCTGATCCGCTTCATGGCGCAGCGCCTG 28080 
KERGFELHEFPLIRFMAQ RL 

2 8081 GAGGACGACGTCTTCCAGTTCACCTACGGCTTCC ACCACGAGATCGTGGACGGCTGGAGC 2814 0 
EDDVFQFTYGFHHEIVDGWS 

2 8141 GAAGCCCTGATGATCACCGAGCTGTTCAGCCACTACTTCTCGGTGATCTACGACGAGCCG 282 00 
EALMITELFSHYFSVIYDEP 

2 82 01 ATCGCGATCAAGCCACCCACCGCCGGCATGCGCGACGCCGTCGCCCTGGAGCTGGAGGCC 28260 
IAI KPPTAGMRDAVALELEA 

2 8261 CTCGCGGACCGCCGCAACTACGAGTTCTGGGACTCCTACCTCGCCGACGCCACCCTGATG 2832 0 
LADRRNYEFWDSYLADATLM 

2 8321 CGGCTGCCCAGGCCCGGCACCGGACCCCGGGCCGACAAGGGCGACCGGGACATCACCCGC 28380 
RLPRPGTGPRADKGDRDITR 

2 83 81 ATCGCCGTCCCCGTCCCCACCGAACTCTCCGACGGCCTCAAGCGGGTCGCCGCCACCCAC 28440 
IAVPVPTELSDGLKRVAATH 

2 8441 GCCGTCCCGCTGAAGACCGTGCTCCTGGCCGCGCACATGGTGGTGATGTCCCTCTACGGC 28500 
AVPLKTVLLAAHMVVMSLYG 

2 8501 GGCCACGAGGACACCCTCACCTACACCGTCACCAACGGCCGCCCCGAGACCGCCGACGGC 28560 
GHEDTLTYTVTNGRPETADG 

2 8561 AGCACCGCGATCGGGCTGTTCGTCAACAGCCTCGCGCTCCGCGTCCGGATGACCGGCGGC 2 862 0 
STAIGLFVNSLALRVRMTGG 

2 8621 ACCTGGGCCGACCTGATCACCGCCACGCTGGAGTCCGAGCGCGCCTCGATGCCGTACCGG 2 8680 
TWADLITATLESERASMPYR 

2 8681 CGGCTGCCGATGGCCGAACTCAAGCGCCACCAGGGCAACGAACCCCTGGCCGAGACGCTG 2874 0 
RLPMAELKRHQGNEPLAETL 

2 8741 TTCTTCTTCACCAACTACCACGTCTTCCACGTGCTCGACCGCTGGATCGACCGCGGCGTC 2 8800 
FFFTNYHVFHVLDRWIDRGV 

2 8801 GGCCACGTCGCCAACGAGCTCTACGGCGAGTCCACCTTCCCCTTCTGCGGCATCTTCCGC 2886 0 
GHVANELYGESTFPFCGI FR 

2 8861 CTGAACCGGGAGACCGGCGAGCTGGAGGTCCGCATCGAGTACGACAGCCTGCAGTTCTCC 28920 
LNRETGELEVRI EYDSLQFS 

2 8921 GACGCCCTCATGGAGAGCGTCCGCGACAGCTACGCCCGCGTCCTCGCGGCCCTGGTCGCC 28980 
DALMESVRDSYARVLAALVA 

28981 GACCCCGACGGGCGCTACGACCGGCACGAGTTCCGCTCCGACCGCGACCGGGCCGCACTG 2 904 0 
DPDGRYDRHEFRSDRDRAAL 
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2 9041 GCCGTCCTCACCCGCGGGCCCGAGGCGCCGGCGGCCGACCGGTGCCTGCACGACCTGGTG 29100 
AVLTRGPEAPAADRCLHDLV 

2 9101 GCGGACCGGGCGGCGGACCGCCCCGACGCCCCGGCCGTCCAGCTGGACACCGACGTGCTC 2 9160 
ADRAADRPDAPAVQLDTDVL 

2 9161 AGCTACGGCGAGCTCGACCGCCGCGCCAACCGGCTGGCCCACCACCTGCGTTCGCTCGGC 2 922 0 
SYGELDRRANRLAHHLRSLG 

2 9221 ATCGGCCCGGAGAGCGTCGTCGGCGTCCTGGCCGAACGCTCCCTCGCCCAGATCATCGGC 2 92 8 0 
IGPESVVGVLAERSLAQI IG 

2 9281 CTCCTCGCGGTCCTCAAGGCGGGCGCCGCCTACGTCCCGCTCGACCCGGCCCAGCCCGAC 2 9340 
LLAVLKAGAAYVPLD PAQPD 

2 9341 GAGCGCCTCGCCGCCGTCATCGCCGGGAGCGGGGCCGCCGCCGTCCTCCACCGGCCCGGC 2 9400 
ERLAAVIAGSGAAAVLHRPG 

2 9401 CTCGAAGGGCGGCTGCCCGCGGGCGTCCGCGCGCTCCCCACCGACGCCGCCGACGGCAGC 29460 
LEGRLPAGVRALPTDAADGS 

2 9461 ACCGCCACGCACGACCCCGGGCCCACCGCCACGCCCCGCAACGCCGCGTACGTGATGTAC 29520 
TATHDPGPTATPRNAAYVMY 

29521 ACCTCCGGATCCACCGGAGAGCCCAAGGGCATCGTCGTCGAACACCGCAACGTCGTGGCC 2 9580 
TSGSTGEPKGIVVEHRNVVA 

2 9581 TCCCTCGCCGCCCGCGGCGCCCACTACGCGGCCGGACCCGGCCGGTTCCTGCTGCTGTCC 2 964 0 
SLAARGAHYAAGPGRFLLLS 

2 9641 TCCTTCGCCTTCGACAGCTCGGTCGCCGGCATCTTCTGGACGCTGACCCAGGGCGGCACC 2 97 0 0 
SFAFDSSVAGI FWTLTQGGT 

2 9701 CTCGTCCTGCCCGGCGAGGGACAGCAACTCGACCCCGCCGCGCTGGTGGAGACCATCGCC 2 97 6 0 
LVLPGEGQQLDPAALVETIA 

2 9761 CGGCAACGGCCCACCCACACCCTCGCCATCCCCTCCCTGCTGGCGCCCGTCCTGGACCAG 2 9820 
RQRPTHTLAI PSLLAPVLDQ 

29821 GCCGCCCCCGGCGACCTCGCCTCCCTGCGCACGGTGATCGCCGCGGGCGAGTCCTGTCCG 2 98 80 
AAPGDLASLRTVIAAGESCP 

2 9881 GCCGAACTGGCCGCCGCCTGCCGGGACCTGCTGCCCGGGAGCACCTTCCACAACGAGTAC 2 994 0 
AELAAACRDLLPGSTFHNEY 

2 9941 GGCCCCACCGAGACCACCGTGTGGAGCACCGTCTGGTCCCAGGAGAACGAGCACGACGGA 30000 

GPTETTVWSTVWSQENEHDG 

30001 CCCCACCTCCCCATCGGCCGGCCGGTCGCGGGCACCTGGGTGCACCCCCGCGACCACCGC 3 0060 
PHLPIGRPVAGTWVHPRDHR 

3 0061 GGACGCACCGTCCCCCTCGGCGTCGCCGGCGAACTCTCCATCGGCGGCGCCGGCGTGGCC 3 0120 

GRTVPLGVAGELS IGGAGVA 

30121 CGCGGCTACCTCGGGCGCCCCCGGGACACCGCGGCCGCCTTCCGCCCCGACCCCGAGGCC 3 018 0 
RGYLGRPRDTAAAFRPDPEA 

30181 ACGGCTCCCGGCGGCCGCGCCTACGCCACCGGCGACCTCGGCCGCTACCTCCCCGACGGC 3 024 0 
TAPGGRAYATGDLGRYLPDG 

30241 AACCTGGAGTTCCTCGGCCGCGCCGACCACCAGGTCAAGATCCGCGGCTTCCGGGTCGAG 3 03 00 
NLEFLGRADHQVKI RGFRVE 

303 01 CTCGGCGAGATCGAGGCCGTCCTCGACACCCACCCGGAGCTCCAGCGGACCATCGTCATG 3 03 60 
LGEIEAVLDTHPELQRTIVM 

30361 GCACGCGGCGACCACCCCGGCGACCAGGTGCTCGTCGCCTACGTCCTCCCCGCCCCCGGC 3 0420 
ARGDH PGDQVLVAYVLPAPG 

3 0421 CGGCGGCCCGAACCCGCCGACATCCAGGGGTACGTCCGCGACCGGCTGCCCCGCTACATG 30480 
RRPEPADIQGYVRDRLPRYM 

3 04 81 GTGCCCACCGCGGTGATCGTCCTCGACGCGGTACCGCTGACCGCCGCCGGCAAGGTCGAC 3 054 0 
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VPTAV IVLDAVPLTAAGKVD 

3 0541 CGGGCCTCGCTCCCCGCCCCCAGCCACGCCCAGCTCACCCGGGACCAGGAGTACGTCGAG 30600 
RASLPAPSHAQLTRDQEYVE 

3 0601 CCCGGCACCGACACCGAGCGGGCGCTCGCCGCCATCTGGGCCGACGTCCTCAAACTGGAC 30660 
PGTDTERALAAIWADVLKLD 

3 0661 CGGATCGGGGCCGGTGACCGCTTCTTCGACGTCGGCGGCGAATCCCTGCGCGCGATGCAG 3 072 0 
RIGAGDRFFDVGGES LRAMQ 

30721 GCCACCGCCGCGGCCAACAAGATGTTCCGCACCCGCGTCTCCGTCCGCCGCCTCTTCGAG 3 0780 
ATAAANKMFRTRVSVRRLFE 

3 0781 GCGCCCTCCCTGCGGGAGTTCGCCCACGAGATCGACAAGGCCCGCCTCGCGGGCGGCGGG 3 0 84 0 
APSLREFAHEI DKARLAGGG 

3 0 841 ACCGGCCTCACCGGCCCCGCGGCCGCCCCGGCCACCGGAGGTGCCGCCGAATGACCCCGG 30900 
TGLTG PAAAPATGGAAE * 

M T P A (orf25) 

3 0901 CCGCCGACACCACCCACCCGCTCTCGCCGGCCCAGCGCAGCATGTGGTTCCTGCACCGGC 3 0 960 
ADTTHPLS PAQRSMWFLHRL 

3 0961 TCGCGCCCGAGGTGCCCGCCTACAACATCTGCACCGCCATCGAGCTCACCGGCACACCGC 3102 0 
APEVPAYNICTAIELTGTPR 

31021 GCCCGGCGGCGCTGCGGGACGTGGTACGGCGGCTCGGCCGCAGGCACGAGGCGCTGCGCA 31080 
PAALRDVVRRLGRRHEALRT 

31081 CGGTGTTCCCGTCGGTGGGGGAGACCCCCCGCCAACGGGTCACCGACCGGGCGGCGCCCC 3114 0 
VFPSVGETPRQRVTDRAAPL 

31141 TGCGGACCGTGGACCTCACCCACCTGACCCCCGCCGCCGCCGAGGCCGAGACCGCACGGA 312 00 
RTVDLTHLTPAAAEAETART 

312 01 CGCTACGGTGCGCCGCCGCCCGGCCGTTCCGGCTCGACACCGGCCCCCTGGCGGAATGGA 312 60 
LRCAAARPFRLDTGPLAEWT 

312 61 CCCTGCTGCGCCGCGCCCCCGGCCACGCGCTGCTCGTCCTCTCCGTCCACCACATCGTCT 31320 
LLRRAPGHALLVLSVHHIVF 

31321 TCGACGGCGGCTCGCTCCACGTGGTCTGCCGCGAACTGGAGGAGGCGTACGGAGCGGCCC 313 80 
DGGS LHVVCRELEEAYGAAL 



O 313 81 TCGCCGGGCGCCCGGACCCCCTCGGCACACCCGCGCCGGGCTACGGACGGCAGTGCCGGA 3144 0 

AGRPDPLGTPAPGYGRQCRT 

31441 CGCGGGCGGCGGAACAGGACGAGGCCGGGCGGGAGTTCTGGCGCCGCGAACTGTCCGGCG 31500 
RAAEQDEAGREFWRRELSGA 

31501 CGCCACCCCGCACGACCGTCTTCCGGGGCACCGGCCGGCCCGCCGGACCGCCCGCCCGCG 31560 
PPRTTVFRGTGRPAGPPARA 

31561 CCACCGTCCACTACGGCACCGACGATCCGGCCCCGACCGCGGACTTCTGCCGCGAGCACG 31620 
TVHYGTDDPAPTADFCREHA 

31621 CCGTCACCGGCTACGTGCTGCTGCTCGCGGCCCTCGCCTGCCTGGTCGCCCGGTACACCG 31680 
VTGYVLLLAALACLVARYTG 

31681 GCCGGACGGACGTGGTGATCGGCTCACCCGTCGGACTGCGCGAGGACCCCGAAGGGCTCG 31740 
RTDVVIGS PVGLREDPEGLA 

31741 CCACCGTCGGCCCGATGCTCAACCTGCTGCCGCTGCGCCTCCGGCTGCACGGCGACCCCG 31800 
TVGPMLNLLPLRLRLHGDPG 

31801 GCTTCGGCGAGGTCCTGGCCCGCACCCGGGAGACGCTGCTCGGCGCGCTGGAGCACCGCA 31860 
FGEVLARTRETLLGALEHRT 

31861 CCACACCGTTCGAGGACATCGTCGACGCGGTGGGCGCCGACCGGGACCCGGACGTCAGCC 3192 0 
TPFEDIVDAVGADRDPDVSP 

31921 CCCTCTTCCAGATCCTCTTCGCCCACGAACGCCCCCCGGCCCCACCCGCGTTACCGGGCG 31980 
LFQILFAHERPPAPPALPGV 
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31981 TCCGTGCCCGCGTCGTACCCGTCCCCGCTCCGGCCGCCAAGTACGAGCTCGCCGTCACCG 32 04 0 
RARVVPVPAPAAKYELAVTA 

32 041 CCACCGAGACGCCCGACGGGCTCCGGCTGATCGTCGAGGCGGAGCACGGACACGGGGAAC 3210 0 

TETPDGLRL IVEAEHGHGEP 

3 2101 CGGCCGAACTCGCCGCCTTCGCCCGCCACTTCGGCGTCCTGCTGGCCGCCGGGGTCCGCG 32160 
AELAAFARH FGVLLAAGVRA 

3 2161 CGCCGGACACACCGCTGAGCCGCCTGCCGCTGCTCACCGACGAGGAGCGGCGCCGGCTCA 3 2 22 0 
P.DTPLSRLPLLTDEERRRLT 

3 2221 CCGACACCACGGCCCCCCGCACCGCGCCGGAGGCCCCCTACCGCCCCCTGCACCGGCTGG 322 8 0 
DTTAPRTAPEAPYRPLHRLV 

3 2281 TCGAGGAGTCCGCCGCCCGCCGGCCCGACGCCCTGGCGGTCGTCGGCGGCACGCGTCACC 3234 0 
EESAARRPDALAVVGGTRHL 

32341 TCAGCTACCGGGAGCTGAACTGCCGCGCCAACCGGCGTGCCGCCTGGCTGCGCCGCGCTG 324 00 
SYRELNCRANRRAAWLRRAG 

324 01 GCATCGGCACCGAGGACGTGGTCGGCGTCCGGCTGGAACGCGGCCCGGAACTCCTCGTCT 3 24 60 
IGTEDVVGVRLERGPELLVS 

3 2461 CGCTCCTCGCCGTCCTCAAGGCCGGCGCCGCCTACCTGCCCGTCGACCCGGCGCTGCCCG 32520 
LLAVLKAGAAYLPVDPALPA 

3 2521 CCGAGCGGGTACGGCTGATGCTCGACGACGCCCGGGCCGCGCTGCTGCTCACCGAGACCG 325 80 
ERVRLMLDDARAALLLTETA 

3 2581 CGCTCGGCACCCCGCCGGCCCCGGCCGGCACCCCCGTGCACCACGTGGACGGACCGCCAC 32 64 0 
LGTPPAPAGTPVHHVDGPPP 

3 2 641 CGCCGACCCGGCCCGGGGACGACGCCGACCACACCGGCCCCGACCTGCCCACCAGCCTCG 32 7 00 
PTRPGDDADHTGPDLPTS LA 

327 01 CCTACCTCCTCTACACCTCCGGGTCGACGGGCCGGCCCAAGGCCGTGGCCCTCCAGCACG 32 760 
YLLYTSGSTGRPKAVALQHD 

327 61 ACAGCGCCGCGGCGTTCCTGCGCTGGGCGGGCCGCGCCTTCGACGGCGGGGAGCTGGCCG 32 820 
SAAAFLRWAGRAFDGGE L A A 

32821 CCGTCCTGGCCACCACCTCCGCCGGCTTCGACCTGTCGGTCTTCGAGCTGTTCGCCCCCC 32 8 80 
VLATTSAGFDLSVFELFAPL 

3 2881 TGGCCCACGGCGGCACCGTCGTCCTCGCCGACAGCGCCCTGCACGTGCCCGCCCTGCCCT 32 94 0 
AHGGTVVLADSALHVPALPW 

3 2 941 GGGCGCCCGCGGCGACGCTCCTGAACACCGTGCCCTCCGCGGCCGCCGCCCTGCTGGACG 33000 
APAATLLNTVPSAAAALLDA 

33001 CCGACGGCCTGCCCGACGGTCTGACGGCCGTCAACCTGGCGGGCGAGCCCCTGACCGCGG 33 06 0 
DGLPDGLTAVNLAGEPLTAE 

33 061 AGCTGGTCGCCCGGCTGCACGCCCGCCTGCCGAAGGCCGCCGTCCGCAACCTCTACGGCC 3312 0 

LVARLHARLPKAAVRNLYGP 

33121 CCTCGGAGGCCACCACCTACGCCACCGCGGCCCTCGTGCCCGCGGGCGGCACCGAGGCGC 33180 
SEATTYATAALVPAGGTEAP 

3 3181 CGGCCATCGGCCGGGCGCTCGGCGCGGCCCGCGTGTGGACCGCCGACGACCGGCAGCGCC 3324 0 
AIGRALGAARVWTADDRQRP 

33241 CCCTCCCCGGCGCGGTCGTCGGTGAACTCCTCATCGGCGGTACGGCCCCGGCCCGCGGCT 3 33 00 
L PGAVVGE LL I GGTAPARGY 

333 01 ACCTCGGCCGGCCGGGACCGACCGCCGACGCCTTCCGGCCCGATCCGACGGGACCGCCCG 3 336 0 

LGRPGPTADAFRPDPTGPPG 

33361 GCTCCCGGCTCTACCGCACCGGGGACCTGGCCGTACGCCGCCCCGACGGCCGGTTCGTGT 3 34 20 
SRLYRTGDLAVRRPDGRFVF 

334 21 TCCTCGGCCGCAAGGACGAGCAGATCAAACTCCGCGGGGTGCGCATCGAACCGGGCGAGG 334 80 



10 




LGRKDEQI KLRGVRI EPGEV 

334 81 TGGAAGCCGCTCTCCGCCAGTGCGCGCCGGTCGCCGCGGCCGCCGTCGTGCTCGCCGGGA 33 540 
EAALRQCAPVAAAAVVLAGT 

3 3541 CCACCGCGGAGAACCACCGCCTCGTCGGCTTCGTCACCCCTTCGCCCGGCGCCCGCGTCG 33600 
TAENHRLVGFVTPSPGARVD 

33 601 ACCCCGAGCGCACCCTCGCCGCGCTGCGTTCGCGCCTGCCCGCCGCCCTCGTGCCCGCCG 33 660 
PERTLAALRSRLPAALVPAA 

33 661 CGCTGGTGGTGTGCGACGCCCTGCCGCTGACCGCCAACGGGAAGACCGACCGGGCCGCCC 3372 0 
LVVCDAL PLTANGKTDRAAL 

3 3 721 TCGCCCGGCGGGCGCGCGGACACCGGCCGGACCACGGCGCGTACGCCCCGCCCCGCACCC 3 3 78 0 
ARRARGHRPDHGAYAPPRTR 

33781 GCGTCGAGAAGGCGGTCGCCGCGATCTGGCGCGAGGTGCTCGGGACCGAACGGGTGGGGA 3 3 84 0 
VEKAVAAI WREVLGTERVGI 

3 3841 TCCACCAGGGGTTCTTCGACGCGGGCGGCACCTCCCTGTCGCTGCTGCGCCTTCACCACC 33900 
HQGFFDAGGTSLSLLRLHHR 

33 901 GGCTGGTCGCGTCCGTCCATCCCGGCCTCCGGCTCGCCGACGTCTTCCGGCTGCCGACCG 33 96 0 

LVASVH PGLRLADVFRLPTV 

3 3 961 TCGCCGCGCTCGCCGCGTTCGTGGACGGGCAGGAGGACGCGCGCGAGACGGCCGTCGGCG 34 02 0 
hJ AALAAFVDGQEDARETAVGD 

3 4 0 2 1 ACGCGGCCCTCCGGGCCGGCCGGCGCCGCGCCGCGGTGGCCGCGCGCCGCAGGAAAGGCG 34 08 0 
AALRAGRRRAAVAARRRKGG 

*4 34081 GCGGACGATGAGCCATGCCGACGCGGGCGACGGGCTCGACGCGGCTGACACGACTGACGC 3414 0 

..g MSHADAGDGLDAADTTDA (orf24) 

g r * 

■a?. a 

\ U 34141 GGCCGACGGGATCGCCGTGATCTCGCTGGGCGGACGCTTCCCCGGAGCGGACCGGGTGGA 342 00 
iS ADGIAVI SLGGRFPGADRVD 

342 01 CCGCCTCTGGACGAACCTGCTCGACCGCGAGGACGCCATCAGCCACTTCACCGCCGACGA 342 60 
^ RLWTNLLDREDAI SHFTADE 

34 2 61 ACGCCTCGCCCGGGGCCGCGACCCCGAACTGGTGCGCCACCCGCGGTTCGTCGGCGCGGA 34 32 0 
: fj RLARGRD PELVRHPRFVGAE 

i==J ...... 

Q 34 321 AGGCGTCCTCGGCGACGTCTCCCTCTTCGACGCCGAGTTCTTCGGCTGCTCGCCGCGCGA 34 38 0 
GVLGDVSLFDAEFFGCS PRE 

34 3 81 GGCCGAAGTCATGGACCCGCAGCACCGGCTCTGCCTGGAGGAGGCGTGGCACGTCTTCGA 3444 0 
AEVMDPQHRLCLEEAWHVFD 

344 41 CACCGCCGGCTACGACCCGGCGGCGACGGGCACCGCGGTCGGGGTGTTCCTCTCCGCGAG 3450 0 

TAGYDPAATGTAVGVFLSAS 

345 01 CCTCAGCTCGTACCTGATCCGCAACGTCCTGCCCGGCGGCGCGGCACAGCGCCTGCTCGG 34 56 0 

LSSYLIRNVLPGGAAQRLLG 

34561 CGGCTTCCCGCTGCTGATCCACAACGACAAGGACTTTCTGGCCACCACCGTGTCCCACAA 34 62 0 
GFPLLI HNDKDFLATTVSHK 

34 621 ACTGGGCCTCACCGGGCCGAGTTACGCCGTCGGCTCGGCCTGCTCGTCCTCCCTCGTCGC 34 68 0 
LGLTGPSYAVGSACSSSLVA 

34681 GGTGCACCTGGCCTGCCAGAGCCTGCTCACCGAGGAATGCGACATGGCGCTGGCCGGCGG 34 74 0 
VHLACQSLLTEECDMALAGG 

34741 GGTCTCGCTCCAAGTGCCGCAGGGCCAGGGGTACGTGCACGCCGACGACGGCATCTACTC 34 800 
VSLQVPQGQGYVHADDGIYS 

34 8 01 AC C CGACGGG CG CTG CG C C C CCTT CGACG C CGG CG CGG CGGG CACGGTGGG CGG CAG CGG 34 860 
PDGRCAPFDAGAAGTVGGSG 

34 861 CGTGGG C CTCGT CCTGCTCAAG CGG CTCGC CGACG CCGTGCGCGACGGGGACCGCGT CCA 34 92 0 
VGLVLLKRLADAVRDGDRVH 
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34921 CGCGGTGATCCTCGGCTCGGCGGTGAACAACGACGGCGCCGACAAGGTCGGTTACACGGC 34 980 
AVI LGSAVNNDGADKVGYTA 

34 981 GCCCGGCGTCACCGGCCAGAGCGCCGTCGTCGCCGAGGCCCTGGCGGTGGCCGGGATCTC 3 5040 
PGVTGQSAVVAEALAVAG I S 

35041 CG C CG CG ACCGTCGG CGT C CTGGAGG CG CACGG CAC CGG C AC C CGGCTGGG CG AT C C CGT 3 5100 
AATVGVLEAHGTGTRLGDPV 

35101 CGAAGTGGCCGCGCT CAC C CGGG CGTT C CG CG C C CACACGGACCG C AG CGG CTTCTGCGC 3 5160 
EVAALTRAFRAHTDRSGFCA 

35161 GCTGGGCTCGGTGAAGGCCAACGTGGGCCACCTGGACGCGGCGGCGGGCGTCACCGGGCT 3 5 22 0 
LGSVKANVGHLDAAAGVTGL 

3 5221 GATCAAGGCCGTGCTGGCGGTCCGCGAGGGCGTCATCCCCGGCACCCCGCACTACCGTTC 3 52 80 
I KAVLAVREGVI PGTPHYRS 

3 52 81 GCCCAACCCCGCCATCGACTTCGCCACCACACCCTTCTACGTCACCGCCGACACCCTCGC 3 534 0 
PNPAI DFATTPFYVTADTLA 

3 5 341 CTGGCCGGAGGCGGACCACCCCCGCCGGGCCGGCGTCAGCTCCTTCGGCATCGGGGGCAC 3 5400 
WPEADHPRRAGVSSFGIGGT 

3 54 01 CAACGCCCACGTGATCCTGGAACAGGCCCCGCCGGCCGCCCCCCGCGCGGACCGGACCGC 3 54 60 
a ». NAHVI LEQAPPAAPRADRTA 

S 3 5461 CGGGGTGCCCATGCCGTTGGTGGTGTCCGCCCGCACCCGCGAAGCACTGGCGGAGGCCGT 3 552 0 

: ,g GVPMPLVV.SARTREALAEAV 

.1 35521 CCGGGACCTGGCGGCGTGGTCGGCCCCGGAGCCGGGGACCCGGCTCGCCGATCTCGCCGC 35580 

RDLAAWSAPEPGTRLADLAA 

m 35581 CACGCTGGCCGGGCGCCGGGCCTTCCCGTACCGCGCCGCCGTCGTGTGCCACGACCTGCC 3 564 0 

TLAGRRAF PYRAAVVCHDL P 

iU 

s 35 641 CGAGGCCGCGCGCCTGCTGGGCGGCGCGCGCGGCGAGACCGCGCTCCCCGGCAGGGAGGC 35700 

EAARLLGGARGETALPGREA 

^ 3 57 01 CGTGTTCCTCTTCCCCGGGCAGGGCACCCTCCCGCCGGACACCGGGCGCGGCCTGTACGC 3 5760 

C3 VFLFPGQGTLPPDTGRGLYA 

^ 3 5 761 GGACGTGCCGGCGTTCCGCGCCCACTTCGACGCCTGTGCCGAAGGGTTCGCCCCGCTCGG 3 5820 

y DVPAFRAHFDACAEGFAPLG 

3 5821 CACCGACCTCCACGCCGCGCTCGGGGCCCCGGCCGACGACACCAGGGCCGCGCAACCCGC 3 5880 
TDLHAALGAPADDTRAAQPA 

3 5881 CCTCTTCGCCGTCGAGTACGCCCTCGCCCGCACCCTGATGGACTGGGGTGTGCGCCCGGC 3 5 940 
LFAVEYALARTLMDWGVRPA 

3 5941 CGCGATGCTCGGGCACAGCCTCGGCGAGTACGTCGCGGCGACGCTGGCCGGGGTGCTGTC 36000 
AMLGHS LGEYVAATLAGVLS 

3 6 001 CCTGCCGGACGCGCTGACGCTCGTCCGGGCCCGGGCGGAAGCGCAGCACACCATGCCGCC 3.6 0 6 0 
LPDALTLVRARAEAQHTMPP 

3 6061 CGGCCGCATGCTCGCGGTCCCGCTCACGCCGGACGACCTGCGCCCGCTGCTGCCCCCGGA 3 6120 
GRMLAVPLTPDDLRPLLPPE 

36121 GGTGGAGTTCAGCGCCTTCAACGCCCCCGGCCGCTGCGTCGTCGGCGGGCCCCCGGAGCC 36180 
VEFSAFNAPGRCVVGGPPEP 

36181 GGTGGCGGAGCTGCGCGCCCGGCTGGCGCGGCGCGGAGTGCCGGCCGCCGAACTGGCCAC 3 624 0 
VAELRARLARRGVPAAELAT 

36241 CGCGCACGCCTTCCACTCGGCGGCCGTCGAACCGCTGCTGGACGGCTTCCGGGGCGTGCT 3 63 00 
AHAFHSAAVEPLLDGFRGVL 

363 01 GGAAGGCGTCCGACTGCGGCCGCCCCGGCTGCGGTACGTGTCCTCCCTCACCGGCGACTG 3 63 60 
EGVRLRPPRLRYVSSLTGDW 

36361 GGCCGACGCCGCGGTCACCACCCCCGCGTACTGGCTCGCCCACCTGCGCCGGCCCGTCCG 3 6420 
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ADAAVTTPAYWLAHLRRPVR 

3 6421 CTTCGCCGACGGCCTGCGGCGCTGCCTGGACCTCGGCCCCGTCGCCCTGGTCGAGACCGG 36480 
FADGLRRCLDLGPVALVETG 

3 6481 GCCGCGGGCCGGACTGACCGGCCTGGCCCGCCGCGCCGCGGGCCCCGGCGAGCCCCCTTA 36540 
PRAGLTGLARRAAGPGEPPY 

3 6541 CACCGTCCGCTGCCTGGCCGCCCCCGACGAGGCGGCTTCGCTGACCCACGCGGTCGCCGT 3 66 00 
TVRCLAAPDEAASLTHAVAV 

3 6 601 ACTCTGGCGCTCGGGCTGCGCCGTCGACTGGACGGCGTTCCACCGCCCCGGGCGCCCCCG 3 6 66 0 
LWRSGCAVDWTAFHRPGRPR 

36661 C CG C AC C AC CGTGCCCGG CT AC C CCTT C C AACGGG T ACGG C ACTGG AT CGACG CG C CGG A 3 6720 
RTTVPGYPFQRVRHWI DAPD 

3 6721 CGAGTCCGAACCCACGGACCTCGCCACCGCCCTGCGCGCGGAGTTGCGGACGGACGGCGA 36780 
ESEPTDLATALRAELRTDGD 

36781 TCCGCCGCTCGCCGTCGATCAGCGGCCCGGACTGCGCACGGGGCTGAACCGGCTGTGCGC 3684 0 
PPLAVDQRPGLRTGLNRLCA 

36841 CGCCCTGGCCCGCGACTACCTGGCCACCGGCGTCGAAGCGAGCGGGGTCCTGCCCGGATT 3 6 900 
ALARDYLATGVEASGVLPGF 

36901 CCACCGCTTCCTGGACTACCTGCGCACCCTGGCCGCCTCCGCACCGGCCGCGGACGACGC 3 6960 
HRFLDYLRTLAASAPA. ADDA 

GGGGACGATCGCCGCGGAGATCACCGCGGCCCACCCGTCCTTCTCCGGGCTCGTCGACCT 3 70 20 
GTIAAEITAAHPSFSGLVDL 

GCTCCGGCACTGCGCCCAGGGCTATCCGCGCGCCCTGTCCACCCCCGGAGCCGCACTGGA 3 708 0 
LRHCAQGYPRALSTPGAALD 

^ ; 3 7081 CGTCCTCTATCCGGCCGGCAGCGGCGACCTCCTGCGCCGCACCCTGGGCGAGGGCACCGC 3 714 0 

TO VLYPAGSGDLLRRTLGEGTA 

CGACCACCGCGCCACCGGCCGCCTCACCCGGCTGGCCGGCTCCCTGCTCGACCGGCTCGC 3 72 00 
DHRATGRLTRLAGSLLDRLA 

GGCCGACCGCGAACCCGGCCGCCCGCTGCGCGTCCTGGAGGCCGGAGCGGGCGCGGGCAG 372 60 
ADRE PGRPLRVLEAGAGAGS 

CCTCACCCAGGCCCTGGTCACCCGGGCCCCCGGCCGGCTCGACTACCACGCCACCGACAT 3 732 0 
LTQALVTRAPGRLDYHATDI 

37321 CTCCCGGCACTTCGTGACCGCACTCGGCCGGGAGGCCGCCCGGCGCGGCCTGGACTTCGT 3 73 80 
SRHFVTALGREAARRGLDFV 

3 73 81 CCGCGCACGCGTCCTCGACATCGCCCGCGACCCAGGCGAACAGGGCTTCGCCGGCGAGCG 3744 0 
RARVLDIARDPGEQGFAGER 

3 7441 GTTCGACGTCGTCTGCGGCCTCGACGTGGTCCACGCCACCCCCGACCTGCGCACCACGCT 3750 0 
FDVVCGLDVVHATPDLRTTL 

37 501 CGGCCATCTGCGCTCCCTGATGGCACCGGACGGCACCCTCGCGCTGATCGAGACCACCGC 3 7560 
GHLRSLMAPDGTLALI ETTA 

3 7561 CGACGACCCCTGGCTGACGATGATCTGGGGCCTGACGGACGGCTGGTGGCACCACACCGA 3 7 62 0 
DDPWLTMIWGLTDGWWHHTD 

37621 CCGGCGCACCCACGGCCCGCTGCTCGACGCCGCCGGCTGGCGCGCCCTCCTGGCCGGCGA 37680 
RRTHGPLLDAAGWRALLAGE 

3 7 681 GGACTTCGCCACGGCCGATGTGATCGTGCCGCCCGACGGCCCCCAGGACGCGGCCCTGCT 3 774 0 
DFATADVIVP PDGPQDAALL 

37741 GCTCGCCCGGCAGACCCCCCGGCCGGCGGCGGCCGCACCGTCCGTCGGCAAGCGGGACGT 37800 
LARQTPRPAAAAPSVGKRDV 

37801 CGGCACGTGGTGCTACGCCCGCGGCTGGCGGCACGCCGCGCCCGCCGACCCCGCCCCGCT 3 7860 
GTWCYARGWRHAAPADPAPL 
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37021 



a 37141 
C 

KJ 37201 

Q 37261 





3 7861 GACGGGCGGCTGCCTGCTGCTGGGCGACGGGGACACGGCGAAGGCCGTCGCGAGCCGGCT 37 92 0 
TGGCLLLGDGDTAKAVASRL 

3 7 921 GGAGGCCCTCGGCGTGCCCGTCACCACCGTCGGCGGCGGCCGACCGCCGGGCCCCGAGCG 37 98 0 
EALGVPVTTVGGGRPPGPER 

3 7 981 GTACCGGGAACTCGTCGGCCCCGCCACCCGCCTGGCCGTCGACCTGTGGCCGCTGCGCGA 38 04 0 
YRELVGPATRLAVDLWPLRD 

38041 CGCGTCCCACCGCGGCCGCGCCGCCGGCGCCGCCGGCGTACGGACCGCCCAGGACGCCGC 38100 
ASHRGRAAGAAGVRTAQDAA 

3 8101 GCTGCACAACCTGCTCCACCTCGCCCGGGCCTTCGGCGCGCTGGAGGAGCGCCACCCCGC 3 8160 
LHNLLHLARAFGALEERHPA 

3 8161 CCGCGTCGTGACCGTGACCACCGGTGCCCACGACGTGCTCGGCGACGACCTCGCCCACCC 3 8220 
RVVTVTTGAHDVLGDDLAHP 

3 8221 CGAGCACGCCACCGTCCCGGCCGCGGCCAAGGTGATCCCCCGGGAGTACCCGTGGATCGC 3 8280 
EHATVPAAAKVI PREYPW IA 

3 82 81 CTGCACCGCCCTGGACGTGGAGCCGGGCCTGGACGCCGAGCGGCTGGCGGACCTGATCGT 3 834 0 
C TALDVE PGLDAERLAD L I V 

3 8341 CCGGGAACTCGGCGCGGCGCGCGAGACCACCGTCACCGCCTGCCGCGGCCGACGCCGCTT 3 84 00 
RELGAARETTVTACRGRRRF 

3 8401 CACCCCCTGCCCCGTCCGGCAGCCCCTCCCCGCCGCACCGGAACGCCCGGCGGTCCGGCC 38460 
TPCPVRQPLPAAPERPAVRP 

3 84 61 CGGCGGCGTCTACCTCGTCTGCGGCGGCCTCGGCGGCATCGGCCTCCACCTCGCCGAGTA 38520 
GGVYLVCGGLGG IGLHLAEY 

3 8521 CCTGGGCCGCGCCCGCACCACCGTCGTCCTCACCCACCGGCGGCCCTTTCCCGCCCCCGG 38580 
LGRARTTVVLTHRRPF PAPG 

3 8581 CGCGTGGGACGGGCTGCCCGCGGGACACCCGGAGGCGGCCGTCGTCCGGCGGCTGCGCTC 3 8640 
AWDGL PAGHPEAAVVRRLRS 

3 8641 CCTCGCCGCCACCGGCGCCACGGTCGTCGTCCGCCGGGCCGACCTCACCGACCACGACGC 38700 
LAATGATVVVRRADLTDHDA 

3 87 01 GATGCGCGCCCTCGCGGACGAGGTGGAACAGGCCCACGGCCCCGTCCGGGGGGTGGTGCA 38760 
MRALADEVEQAHGPVRGVVH 

3 8761 CGCGGCCGGGGTGCCCGACACCGCCGGCATGATCCAGCGTCGCGACCGAGCCGGCACGGA 38820 
AAGVPDTAGMI QRRDRAGTD 

3 8821 CGCCGCCCTCGCCGCCAAACTGACCGGCACCCTCGTCCTGGACGAGGTGTTCGCCCACCG 388 80 
AALAAKLTGTLVLDEVFAHR 

3 8881 CGACCTCGACTTCCTCGTCCTGTGCTCCTCGATCGGCACCGTGCTGCACAAGCTGAAGTT 3 8940 
DLDFLVLCSSIGTVLHKLKF 

38941 CGGCGAGGTCGGCTACGTGGCGGGCAACGAGTTCCTCGACGCCTATGCCGCCCACCGCGC 3 900 0 
GEVGYVAGNEFLDAYAAHRA 

3 9001 GGCCCGCCGCCCCGGCAGAACCCTGTCGATCGCCTGGACCGACTGGCGGGAGTCGGGCAT 3 9060 
ARRPGRTLS IAWTDWRESGM 

3 9061 GTGGGCCGCCGCCCAGCGCCGTCTGACCGAGCGCTACGGCACCGGCGCCGACCTGCCCGT 3 9120 
WAAAQRRLTERYGTGADLPV 

3 9121 ACCGCCCGGGGGCGACCTGCTCGGCGCGATCAGCCCCGAGGAGGGCGTCGACGTCTTCGC 3 9180 
PPGGDLLGAISPEEGVDVFA 

3 9181 CCGGCTGCTCGCCGCCGACACCGGCCCGAACGTCATCGTGTCGGCCCAGGACCTCGACGA 3 9240 
RLLAADTG PNVIVSAQDLDE 

3 9241 ACTCCTCGCGCGGCACGCGGCGTACACCACCGACGACCACCTCGCCGCCCTCGGCGACCT 3 93 0 0 
LLiARHAAYTTDDHLAALGDL 

3 9301 GAGGATCGCCGCCGCCCGGGACCGCTCCGCGCCCGCCGCGCCGTACGCGGCCCCCCACAC 3 93 6 0 
RIAAARDRSAPAAPYAAP HT 
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3 9361 GCCCGCCCAGCGGCGGATCGCCGGCTGGTACCGCGACCTGCTCGGCGTCGAACACGTCGG 3 9420 
PAQRRIAGWYRDLLGVEHVG 

3 9421 CCTCGACGACGACTTCTTCGCGCTCGGCGGGGACTCGCTGCTCGCCCTGCGCCTGCTGTC 3 948 0 
LDDDFFALGGDSLLALRLLS 

3 9481 G C AG C TG CGGG ACG C CTACGGGGTGGAG AT CTCCGT CGC C CG C ATGTT CG ACG AG CC C AC 39540 
QLRDAYGVE I SVARMFDEPT 

3 9541 GGTGGCGGCGCTGGCCGCCGCCACCGGCCCGCCGCCGGAAGAGACGCCCGGCCAGGAAGA 3 9600 
VAALAAATGPPPEETPGQEE 

3 9601 GGTGGTGCTGTGACCACGCCCCGCATCACCGACCTGCTCACCGAGCTCCGCGGCCGGCAG 3 9660 
V V L * 

MTTPRITDLLTELRGRQ (orf23) 

3 9661 GTGACCCTCACGGCCGACGGGGACCGGCTGCACTGCCGCGCGCCCCGGGGCGCGCTCACC 3 9720 
VTLTADGDRLHCRAPRGALT 

3 9 721 GACGAGCTCCTCGCCACCATCCGCGCCCGCCGCGACGAACTCCTCGCCCACCTGCGCGCC 3 978 0 
DELLATI RARRDELLAHLRA 

3 9781 GACCGCCGCATCCCGCGCCACGACGGGCCCGCGCCGCTGTCCTTCGCCCAGGAACGGCTC 3 984 0 
DRRI PRHDGPAPLSFAQERL 

^ 39 841 TGGCTCCTCCACCAGTTCCACCCGCACGACAGCGCCTACAACATCCCCCTGCACATCGCC 3 9 900 

!= J WLLHQFHPHDSAYNIPLHIA 

13 £ 3 9901 CTGCGCGGGCCCCTGAACCCGGCCGCCCTGCGCGCCGCCCTGGCCGAGGTGGTACGGCGG 3 9960 

Sit LRGPLNPAALRAALAEVVRR 

3 9961 CACGACGTCCTGCGCACCCGGTACGCCATCAGCCGCGGCCTGCCCCGGCCCGTCGTCGAA 4 0 02 0 
HDVLRTRYAI SRGLPRPVVE 

j:J J 4 0 021 CCGGCCCACACGCCGCCGCTGCCCCTGACCGACCTGACCGGGCTCCCCGCACACCACCGG 4 0 080 

! ^ PAHTPPLPLTDLTGLPAHHR 

|=3 4 0 081 GACGCCGAACTCGCCCGGCTGGCCGCCCAGGAGGCCAGGCGGCCCTTCGACCTCGCCCAG 4 014 0 

! = T DAELARLAAQEARRPFDLAQ 



n 



4 0141 GGCCCGGTGCTGCGGGCCCGGCTCCTCCGAACGGCCCCCGAGGAGCACCGGCTGCTGCTG 4 0200 
GPVLRARLLRTAPEEHRLLL 

4 0201 ACCCGCCATCACATCGCCAGCGACGGCTGGTCGCTCGACATCCTGCTCCGCGAACTGGGC 4 026 0 
TRHH IASDGWSLDILLRELG 

4 0261 ACGTTCTACCGGGCAGGGCGGGACGGCACACCCGCCGGCCTCGACGCCCTGCCGCTGCGG 4 032 0 
T FYRAGRDGT PAGLDAL P LR 

4 0321 TACGCCGACTTCGCCGCGTACCAGCGCGAACAGGCCGAACGGCCGGAGACGGCCGAGCGG 4 0 3 80 
YADFAAYQREQAERPETAER 

4 03 81 TCGACCCGCTGGGCACGGCACCTGAGGGGCGCCCCCGCGACACTCGACGTCCTCGGGCCC 4 0440 
STRWARHLRGAPATLDVLGP 

4 0441 CCGCCCGCCGAACCCTCCCACGCGCCGGCCGGCACCGTACGGACGGACCTTCCCGCCGCC 40500 
P PAE PSHAPAGTVRTDLPAA 

40501 CTCGTCACCGGCCTGCGGCAGCTGGGCGGCCGGGCCCGCACCACGCTCTTCCCGCTCCTG 4 0 56 0 
LVTGLRQLGGRARTTLFPLL 

4 0561 CTG AG CGC CTTCGG C CT CGC C CTGGC CGG C C CG C CCGGC C CGTACG ACGT CATGGT CGG C 4 062 0 
LSAFGLALAGPPGPYDVMVG 

4 0621 ATCCCCGTCGCCGGCCGGCCGCGCACCGAACTGGAGCCGCTCATCGGCTGCTTCGCGACC 40680 
I PVAGRPRTELEPLIGCFAT 

40681 ATCGCGCCGATGCGGCTGACGAGCGACGGGACCGAGCCGCTGACCCGGCTCGCCGCCCGC 4 0740 
IAPMRLTSDGTEPLTRLAAR 

40741 GCCCAGCAGCACGTCCAGGACGCGCTGGACGGACCCGACGTCCCCTTCGAGCGGCTCGTG 40800 
AQQHVQDALDGPDVPFERLV 
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IF] 



408 01 CACGCGCTGCGTCCGGAGCGGGACCTCGCGGAGAACCCCCTGTTCTCGGCGTCGTTCGCC 4 0 860 
HALRPERDLAENPLFSASFA 

4 0861 TTCCAGAACACCCCGCGGACCGCCGTGCGCCTCCCCGGCCTGGACGCCGAGGTGCTGCCC 4 092 0 
FQNTPRTAVRLPGLDAEVLP 

4 0 921 TCGCCGCCCGTGGCCCCCAAGTTCCCGCTGGCCCTCACCGCGACGGCGCGGGCCGACGGC 4 0 980 
S P PVAPKF PLALTATARADG 

4 0981 GGAATGGGCCTGGAGCTGGAGTTCGACCGGGACCGGATCGCCGAGCCGGTCGCGCGGGGG 41040 
GMGLELEFDRDRIAEPVARG 

41041 ATCCTCACGTCCTTCCACGCCGCCCTCGCCCGCGCGGTCGCCGACCCCGAGGCCCGGGCG 41100 
I LTS FHAALARAVADPEAPA 

41101 GCGCCCGTACCGGCCGCCGCCGTGGACCGGCGGCCCGGGCGCGAAGGACACGAGTGCCTC 41160 
APVPAAAVDRRPGREGHECL 

41161 CACGAGCCGGTGGCGCGGGCGGCGGCACGCCACCCCGACGCCGTCGCCGTCAGCTGCGGC 4122 0 
HE PVARAAARH PDAVAVS CG 

41221 GGCACCCAGCTCAGCTACGGGGCGCTCGACACCCGCGCCGAACGGCTGGCCGCGGTGCTG 4128 0 
GTQLSYGALDTRAERLAAVL 

41281 CGCGCCCACGGCGCCGGCCCCGAGCGGCTGGTGGCCCTGTGCCTGCCCACCGGCCCCGAA 4134 0 
RAHGAGPERLVALCLPTGPE 

41341 TGGGTCGTCGGCGCCCTCGCCATCCTCAAGTCCGGCGCCGCCTACCTGCCGCTCGACCCC 414 00 
WVVGALAI LKSGAAYLPLDP 

414 01 GGCGACCCGGCCGAGCGCCGCGCCTCCGTCGCCGCCGACGCGGGAGCGACGCTGATCGTC 414 60 
GD PAE RRA S V A AD AG A T L I V 

414 61 TCCGACACCGCGCTTCCCCCGCTCCACCGCGTCGACGTCACGGCCACCCTCCCGGACGGC 4152 0 
SDTALPPLHRVDVTATLPDG 

41521 GCCCCCGAGCCCACCGCCCGGGCCGTCCTGCCCGGCAACCTCGCCTACGCCGTCTACACC 41580 
APEPTARAVLPGNLAYAVYT 

41581 TCCGGCTCCACCGGCGGCCCCAAGGGCGTGCTCGTCACCCATGCCAACGTCACCGGGCTC 4164 0 
SGSTGGPKGVLVTHANVTGL 

41641 CTGGCCGCGTGCCGTGAGGCCCTGCCCGCCCTGGACGCCCCCCGGACCTGGTCGGCGACC 41700 
LAACREAL PALDAPRTWSAT 

417 01 CACTCGCCGGCCTTCGACTTCTCCGTCTGGGAGGTCTGGGGCCCGCTGACCGCCGGCGGA 41760 
HSPAFDFSVWEVWGPLTAGG 

41761 CGCCTCGTCCTCGTGCCCCCGGACGTGGCCCGGGCCCCGGACGAACTGTGGGACACCCTC 41820 
RLVLVPPDVARAPDELWDTL 

41821 CGCGACGAACAGGTCGAAGTCCTCAGCCAGACCCCCAGCGCGTTCCACCACCTCCTGCCC 418 80 
RDEQVEVLSQTPSAFHHLLP 

41881 ACCGCCGTGCGCCGGGCGGCCCAGGCCACCGCGCTCGAACTCGTCGTCCTGGGCGGCGAG 41940 
TAVRRAAQAT ALELVVLGGE 

41941 GCGTGCGAGCCCGCCCGTCTGACGCCTTGGTGGGACGCCCTGGGCGACCGGCGCCCGGCC 42000 
ACEPARLTPWWDALGDRRPA 

42001 GTGGTCAACATGTACGGCATCACCGAGAACACCATCCACGTCACCGTCCGCCGGATGACG 42 060 
VVNMYGI TENTIHVTVRRMT 

42 061 GCGGCGGACCGGTCGGGCAGTCCCGTCGGCCGGCCGCTGCCGGGGCAGCGCGCCGACCTT 4 2120 
AADRSGS PVGRPLPGQRADL 

42121 CTCGACCCCCACGGCCGGCCCGTCGCGCCGGGCGGGCGGGGCGAACTGTTCGTCGGCGGC 4 2180 
LDPHGRPVAPGGRGELFVGG 

42181 GTCGGACTGGCCCGCGGCTACCTCGGCCGGCCCGGCCTCACCGCCCGGAGCTTCCTGCCG 4 224 0 
VGLARGYLGRPGLTARSFLP 

42241 G ACG ACAC C C CCGGCTGG C CGGGCG CGCG CCG CT AC CGCT C CGGAGACCTGG CC CGG CTG 4 23 00 
DDTPGWPGARRYRSGDLARL 
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423 01 CTGCCCGACGGCGGCCTGGACTACGCGGGCCGCTCCGACGCACAGGTCAAGGTCCGCGGC 42360 

LPDGGLDYAGRSDAQVKVRG 

4 2 361 TACCGCGTCGAGCCCGCCGAGACCGAAGCCGCCGCGCTGACCCATCCCGCCGTGCGCCAC 424 2 0 
YRVEPAETEAAALTHPAVRH 

4 24 21 TGCGTGGTCGTGCCACGCGGCGACGGCGACCGGCGCCATCTCGCGGCGTACGTCGTCGCC 42480 
CVVVPRGDGDRRHLAAYVVA 

424 81 GACACCCGCGCCTGCGACGGGCCCGGGCTCCGCACCCACCTGGCCGAGCGGCTGCCCCGC 42 540 

DTRACDGPGLRTHLAERLPR 

42 541 CACCTGGTGCCGGCCTCGGTGGTCTTCCTGAAGCGGATCCCGCTGACCCGCAACGGCAAG 42600 
HLVPAS.VVFLKRI PLTRNGK 

42 601 CTCGACGTGGCGGCCTTGCCCGACCCGGCCGCCCACCGCGCACCCGCCCGCGAACGCCCG 4266 0 
LDVAALPDPAAHRAPARERP 

4 2 661 CGCACCGCGACCGAACGGACCCTCACCCGGCTGCTCGCCGCCCTCCTGAAGGCGCCACCG 4 2 72 0 
RTATERTLTRLLAALLKAPP 

42 721 GAGACCATCGGGACGCACGACAACCTCTTCGACCTGGGCGGCGACTCCCTGACGGTCACC 4 278 0 
ETIGTHDNLFDLGGDSLTVT 

427 81 CAGTTCCACTCCCGGGTGGTGGAGGAGTTCGCCGTGGACCTCCCGGTGCGCCGGGTCTAC 4284 0 
QFHSRVVEEFAVDLPVRRVY 

42 841 CAGGCCCTCGACATCGCGACGCTCGCCGTGACCGTGGACGACTTCCGGCGCCGCGCCGAA 42 90 0 
QALDIATLAVTVDDFRRRAE 

42 901 CGCACCGCGGTACTGCGCGCCCTCGCGGCGGCGGAGGCGATGGAACCCGGCGGTACGGCG 4 2 960 
RTAVLRALAAAEAME PGGTA 

42 961 GGGGAGT C CGG CGGT AAT C CGG AGG AGT C CG C CG CT ACGG CG CGGGGG CCCGCCGTCGCG 4 3 020 

GESGGNPEESAATARGPAVA 

4 3 021 GCGAACGAACCCGGCGCTGCGGCGCGTGAGTCCGGCGCCGCGCCGGTGGAGCCCGCCGTC 4 3 08 0 
ANE PGAAARESGAAPVEPAV 

43 081 GCAGTACAGGAGTCCGCCGCTACGAAGGGGGAGCCCGGCACCGCAGCGAATGAACTCGGC 4 314 0 

AVQESAATKGEPGTAANELG 

43141 GCTGAGGCACGGGAGCCCGGCACCGCAGCGCAGGAACCCGGCACCGACCCCCGGCCACCC 4 32 00 
AEAREPGTAAQEPGTDPRPP 

43 2 01 GCCGCCACACCGCAGGACCCCCGCACCACACCGCAGGAAGGACAGCCGTGCCCGCGTCCC 4 3260 
AATPQDPRTTPQEGQPCPRP 

43 2 61 GAATGAGCCGGCCGGCCGGCATCGTCGACATCGCGCGCCGTCACGCCGAGCGCACCCCCG 43 32 0 

MSRPAGIVDIARRHAERTPA (orf22) 
E * 

433 21 CCCGTCCCGCGTACGCGTTCCTGCCCGACGGCGAGACGGAGAGCGTCCGCTTCTCCTTCG 4 3 38 0 
RPAYAFLPDGETESVRFSFA 

43 381 CCGACATCGACCGGCGGGCCCGCGCCGTGGCCGCCGTCCTCCAGGACCGCGGCCTGGCCG 4 344 0 
D I DRRARAVAAVLQDRGLAG 

43441 GGGAGCGGGTCCTGGTCGCCTATCCCTCCGGGCCCGAGTACGTCCAGGCGTTCCTGGGCT 4350 0 
ERVLVAYPSGPEYVQAFLGC 

43 501 GCCTGTACGCGGGCGTGGTCGCCGTCCCCTGCGACGAGCCGCGCTCCGGCCCGAGCGCGG 43 560 
LYAGVVAVPCDEPRSGPSAE 

4 3561 AACGGCTCGCCGGGATCCGCGCCGACGCCCGCCCCGCCCTGGCCCTGACCGCCGGCGCCC 43 62 0 
RLAGI RADARPALALTAGAP 

43621 CCGAGGCCGGGCTCGCCGGCCTGGCCACCCTGGACGTGGCCGGCGTCCCCGACTCCGCCG 43 680 
EAGLAGLATLDVAGVPD SAA 

43681 CCGGGGCCTGGACCGACCCCGTCGCGGGACCGGACGCCCTGGCCTTCCTCCAGTACACCT 4374 0 
GAWTD PVAGPDALAFLQYTS 
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43741 CCGGATCGACCCGCCGCCCCCGCGGCGTCATGGTCGGCCACGGCAATCTGCTGGCCAACG 43800 
GSTRRPRGVMVGHGNLLANE 

43 801 AGCGCTGCATCGCCGCCGCCTGCGGCCACGACCGGGACTCCACCTTCGTGGGATGGGCGC 43860 
RCIAA ACGHDRDSTFVGWAP 

43 861 CGTTCTTCCACGACATGGGCCTGGTCGCCAACCTCCTCCAGCCCCTCTACCTCGGGTCCC 4 3 92 0 
FFHDMGLVANLLQ PLYLGS L 

43 921 TGTCGGTGCTGATGCCGCCGATGGCCTTCCTCCAGCGCCCGGCCCGCTGGCTGCGGGCCG 4 3 98 0 
SVLMPPMAFLQRPARWLRAV 

43 981 TCTCCCGCTACCGGGCGCACACCAGCGGCGGCCCCAACTTCGCCTACGACCTGTGTGTCG 44 04 0 

SRYRAHTSGGPNFAYDLCVD 

4 4 041 ACCGGGTCGGCGAGGACGAGCGGGCCGGACTGGAC CTGTCGGGCTGGAAGGTCGCCTACA 44100 
RVGEDERAGLDLSGWKVAYN 

4 4101 ACGGCGCGGAACCTGTACGGGCCGACACCCTGCGACGGTTCACCGACCGCTTCGCCCCCC 44160 
GAE PVRADTLRRFTDRFAPH 

44161 ACGGCTTCACCCCCGGCGCGCACTTCCCGACCTACGGGCTCGCCGAGGCGACCCTGCTCG 442 2 0 
GFTPGAHFPTYGLAEATLLV 

44221 TCGCCACCGGCCCCAAGGGAGTGCCGCCCCGCACCCTGACCGCCGACCGCGCCGCCCTGC 442 80 
ATGPKGVPPRTLTADRAALR 

442 81 GCGCCGGCCGGCTCCGGCCCGCCGGGCCCGGCGAGGCCGGCCTGGAACTGGTCGGCAACG 44340 
AGRLRPAGPGEAGLELVGNG 

44341 GCACCGCCGGCCTCGACACCACCCTCCGGATCGTCGACCCCGCGACCGCGCGGGAGTGCC 444 0 0 
TAGLDTTLRIVDPATARECP 

444 01 CGCCCGGAGAGGTCGGCGAGGTCTGGGTGCGCGGCCCGGGCGTGGCACGCGGCTACTTCG 44 4 6 0 
PGEVGEVWVRGPGVARGYFG 

4 4 4 61 GCCGCCCGCGCGAGTCCGCGCCGCTGCTCGCCGCCCGCCTGCCCGGCGGCGAAGGACCGT 4 4 52 0 
RPRESAPLLAARLPGGEGPY 

44 521 ACCTGCGGACCGGGGACCTGGGCGCCCTGCACGACGGGGAACTCTTCCTCACCGGACGCC 44 5 80 

LRTGDLGALHDGELFLTGRH 

44581 ACAAGGACCTCATCGTCATCCGCGGCCAGAACCACCACCCGCACGACCTCGAACGGACCG 44 64 0 
KDLIVI RGQNHHPHDLERTA 

44641 CCGAGCAGGCCCACCCGGCGCTCCGCCCGACCTGCGCCGCCGCGTTCGCGGTGCCCGGGG 44 7 00 
EQAHPALRPTCAAAFAVPGD 

44701 ACGGCGCGGAGCGGCTCGTGCTCGTCTGCGAACTCACCTCCTACCGCGCCGTCGACCCGG 44 760 
GAERLVLVCELTSYRAVDPA 

44761 CCGCCGTCGCCGAGGCCGTCCGGGCCGCGCTCGCCGCGCGGCACGGCGTCGCCCCGCACA 44 8 2 0 
AVAEAVRAALAARHGVAPHT 

44821 CG CTGGTGGTG C TGCG CCGCGG CGGC ATC C C C AAGAC CAC C AG CGGAAAGGTG CGG CG CG 44 88 0 
LVVLRRGGI PKTTSGKVRRG 

44 881 GCCACTGCCGGACGGCCTACCTCGACGGAACGCTCCCCGTTCACACGGCCGTCCGCCTCC 44 94 0 
HCRTAYLDGTLPVHTAVRLP 

4 4 941 CGGCGGGGGAGGAGGGCACCGAGGCCCTTCCCCTGACCACGGACCCCGGTCGGCTGGCCA 45000 
AGEEGTEALPL.TTDPGRLAT 

4 5001 CGGCGCTGCGCGACCTGGCCGCCGCCCACGCGGGCCTGGCCGGGCCCCTCCCCGGCACCG 4 5060 
ALRDLAAAHAGLAGPLPGTD 

4 5061 ACGAGCCGGTGAGCGCCCTCGGCCTGGACTCGCTCGCCTCCCTGCGGCTCCACCACCACG 4512 0 
EPVSALGLDSLASLRLHHHV 

4 5121 TCCAGTCCGCCTACGGCGTGACCCTGCCCGTCACCGCCCTGCTCGGCGACACCACTTACC 45180 
QSAYGVTLPVTALLGDTTYR 

4 5181 GCCGGCTCGCGGAGCTGACGCTCGCCGCCCCCCGCCCGGCCCGGGCGCCCGAGGGGCAAG 4524 0 
RLAELTLAAPRPARAPEGQV 
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45241 TCACCGGCGTCTGGCGGCCGTTGACGCACGGGCAGCGCGCCCTGTGGTACGAACAGGCGC 4 53 00 
TGVWRPLTHGQRALWYEQAL 

453 01 TCGCCCCGCACGCGGCCGCCTACCACCTCGTCCGCGCGCTGGCCCTCCGCGGCCCCGTCG 45360 
APHAAAYHLVRAL ALRGPVD 

4 53 61 ACGAGGAGGCCCTCGCCGAGGCGGTCCGCCGCGTCGTCCGCCGCCACCCCGCCCTGCGGA 4 542 0 
EEALAEAVRRVVRRHPALRT 

4 5421 CCCGCTTCGCGCTCCGCGACGGCGAACCGGCGCGCCGGACCGAGCCGTACGGACCGGAGC 4548 0 
RFALRDGEPARRTEPYGPEL 

4 5481 TGGACGTACGCGACGCCACCGGCCTGCCGGCGGACCGGCTCCGCGAACACCTGGCCGCGG 45540 
DVRDATGLPADRLREHLAAA 

4 5541 CGGGCGACCGCCCCTTCGACCTGGCCGCCGGCGACAGGCCCGTGAGGCTGACGCTCTACC 4 56 00 
GDRPFDLAAGDRPVRLTLYR 

4 5601 GCACGGACGGCGGCCACATCCTGCTGCTGGTCGCCCACCACCTGGTCGCCGACTTCTGGT 4 5660 
TDGGHI LLLVAHHLVADFWS 

45661 CCCTCGTCGTCCTCCTGGGCGACCTCGCCCGGGCCCACGCGGGCGAGGACCTGCCGCCCG 4 572 0 
LVVLLGDLARAHAGEDLPPA 

4 5721 CGCCGGAGGGGGACCCCGGCGACGAGGCGACGGACGCGGACCGGACGTACTGGCGGCACC 4 578 0 
PEGDPGDEATDADRTYWRHR 

l M 457 81 GGCTCGCCGACGCGCCACCCGCCCTCGACCTGCCCACCGACCTCCCCCACCCCGCCGAGC 4 5 84 0 

:s p LADAPPALDLPTDLPHPAER 

^ 4 5841 GCGGCTTCGCCGGCGCCACCCACGCCTTCCGGCTGCCCCCGGACCTCACCGCCCGGCTGA 45 900 

^ GFAGATHAFRLPPDLTARLT 
'A ...... 

I'f§ 45901 CCGCCCTCTCCCGGGAACGGCACTGCACCCTCTTCACCACCCTCCTCGCCGCCCACCAGC 4 5 960 

j!ss ALSRERHCTLFTTLLAAHQL 

; s 45961 TACTGCTCCACCGCCTGACCGGGCAGGACGACCTCGTCGTGGGCACCCTCCTCGCCCGCC 4 6 020 

Q LLHRLTGQDDLVVGTLLARR 

j)^ 46021 GCGACACCGCCGAAGCGGCCGGCGCCGTCGGCTACCTGGTCAACCCGCTGCCGCTGCGCT 4 6 08 0 

Q DTAEAAG AVGYLVNPLPLRS 

460 81 CCGTACGGGAGCCGGGGGAGACCTTCACGGAACTGCTGCGCCGCACCCGGCGGACCGTGC 4614 0 
;;.t? VREPGETFTELLRRTRRTVL 

I* !i 

46141 TGGACGCGGTCGCGCACGGCCGCCACCCCTTCGGGCCGCTCGTCTCCCGTCTCGCCCCCG 4 6200 
DAVAHGRHPFGPLVSRLAPA 

4 62 01 CGCGCACGCCCGGCCGCGCGCCGCTCCTGCAGAGCCTGTTCGTGCTCCAGCGCGAGTACG 4 626 0 
RTPGRAPLLQSLFVLQREYG 

4 62 61 GCGACGAGGCGGACGGGTACCGCGCGCTCGCCCTGGGCGTCGGCGGCCGGCTGCGCGTCG 4 63 20 
DEADGYRALALGVGGRLRVG 

4 63 21 G CGG AC T CG AC CTGG AGG CACT CG CGT TGC CG CG C CGCTGGT CGC AG CT CGAC C T CT CGC 4 63 8 0 
GLDLEALALPRRWSQLDLSL 

4 63 81 TGAGCATGGCGCGGCTCGGGGACGGGCTGACGGGGGTGTGGGAGTACCGCACCGACCTGT 4 644 0 
SMARLGDGLTGVWEYRTDLF 

4 64 41 TCACCGAGGCCACGGTCGCGGAGCTGAGCGAGGCGTTCGTCCACCTGCTGCGGGCGGCCG 46500 
TEATVAELS EAFVHLLRAAV 

4 65 01 TCGAGGACCCGGGCGCGCCCGTGGAGACGCTGCCGCTCACCGGCGGCCGGGAGACCGGGC 4656 0 
EDPGAPVETLPLTGGRETGP 

4 65 61 CGCGCCGCGGCCCGTCGGCGGCCCGGCCCGCCCTCCCGCTGCACCGGCTCGTGGCCGCGG 4662 0 
RRGPSAARPALPLHRLVAAA 

4 6621 CGGCGCGCCGCGATCCCGCACGGACGGCGGTCGTCGCACTCGCCCCGGACGGCACCGCCC 466 80 
ARRDPARTAVVALAPDGTAH 

4 6681 ACCACATCAGCCACGGAGCCCTGCACCGCGCGGCCACCACCCTCGCCGCCCGGCTCCGCC 4 674 0 
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n 



HISHGALHRAATTLAARLRR 

4 6741 GGGAGGGCGCCGGCCCGGAGCGGCCCGTCGCCGTGCTCGTCGAGCGGGGCCCCTGGCTGC 46 800 
EGAGPERPVAVLVERGPWLP 

4 6801 CCGTCGCCTACCTCGGCATCCTGCACGCCGGGGCCACCGTGCTGCCCCTGGACCCGGAGG 4 6860 
VAYLGI LHAGATVLPLDPED 

4 6861 ACCCCCCGCACAGGCTCGCCCGGACGATCGCGAACTCGGGGGCGCGGCTGCTGCTCACCG 4 6920 
PPHRLARTIANSGARLLLTE 

46921 AGACCGGGACCGCCTCGCGCGCGGCCGAGGCGGCCGGTCCCGGCGTACGCGCGCTGACCG 4 6 980 
TGTAS RAAEAAGPGVRALTV 

469 81 TGCGTGAGGGTGCCACCGGCGGCGAGCGGTTCTCGGCGGACGTCCACCCCGAGCAGTCCG 4 704 0 
REGATGGERFSADVHPEQSA 

4 7041 CGTACCTGCTGTACACCTCCGGGTCGACGGGCGACCCCAAGGGCGTGCTCGTCCCGCACC 4710 0 
YLLYTSGSTGDPKGVLVPHR 

4 7101 GGGCCATCGTCAACCGCCTCCTGTGGATGCAGGAGACCTACCGGCTGCGCCCGGGGGAGC 4 716 0 
AI VNRLLW MQETYRLR PGER 

4 7161 GGGTCCTGCACAAGACGCCGGTGACGTTCGACGTCTCGATGTGGGAGCTGCTGTGGCCGC 4722 0 
VLHKTPVTFDVSMWELLWPL 

4 72 21 TGACCGCCGGGGCGACCGTCGTCATGGCCCGGCCCGGGACCCACCGCGACCCCGCGCGAC 4 72 80 
TAGATVVMARPGTHRDPARL 

4 72 81 TCGTCCGGCGGATCGCCCGCGAGGCCGTCACCACCGTGCACTTCGTCCCCTCGATGCTCA 47340 
VRRIAREAVTTVHFVPSMLT 

4 7341 CCCCGTTCCTCACCGAGCTCGCCCGCGGCACGACGCGGCTGCCCGCGCTGCGGCGCGTGG 474 00 
PFLTELARGTTRLPALRRVV 

474 01 TGTGCAGCGGGGAAGAGCTGCCCGCGGCCGCGGTGAACCGCGCCGCCGGACTCCTCGACG 4 74 60 

CSGEELPAAAVNRAAGLLDA 

4 74 61 CCCGGCTGTACAACCTCTACGGCCCGACCGAAGCCGCCGTCGACGTCACCGCCTGGCCCT 47 520 
RLYNLYGPTEAAVDVTAWPC 

47521 GCCGCCCGCCCGAGCCGGGGCCGGTGCCGATCGGCCTGCCCATCGCCAACACCACCACCG 47 580 
RPPEPGPVPIGLPIANTTTE 

475 81 AGGTCCTCGACGGCCGGCTGCGCCCGCTGCCCCGCCCGGTGCCCGGCGAGCTGTACCTGG 4 7 64 0 

VLDGRLRPLPRPVPGELYLG 

47641 GCGGCGCCTGCCTGGCCCATGGCTACCACCACGACCCGGCCCTGACCGCCGCGCGCTTCC 4 7 70 0 
GACLAHGYHHD PALTAARFL 

477 01 TTCCGGCCCCCGGCGGCGGGCGCCGCTACCGCACCGGGGACCTCGTCCGCCAACGGGCCG 4 7760 

PAPGGGRRYRTGDLVRQRAD 

4 7761 ACGGGGCACTGGTGTTCCGGGGACGCACGGACGACCAGGTGAAGATCGGCGGCATCCGGG 47 82 0 
GALVFRGRTDDQVKIGGIRV 

47 8 21 TCGAGCCCGGCGAGGTGGCGGAGGCGCTTCGGGCCCTGCCCGGCGTCGCCGACGCCGCGG 4 7 880 
EPGEVAEALRALPGVADAAV 

478 81 TCGTCCCGCACGACGGGCGGCTGGCGGCGTACGCGGTCGCCGACCCGGTCGGCCCGGCCC 47 940 

VPHDGRLAAYAVADPVGPAP 

47941 CGGCGGCGGACGCCCTGCGGGACGCGCTGCGCAGGCGGCTGCCCGGCCACCTGGTGCCCG 48000 
AADALRDALRRRLPGHLVPA 

48001 CCGCCCTCACCCTGCTGGACCGGCTGCCCCTCACCCCGGCGGGCAAGCTCGACCGCCGGG 48060 
ALTLLDRLPL.TPAGKLDRRA 

4 8061 CGCTGCCCCACCCGTCGGCCCCGCCCCCGGACGGCGGACGGCCGCCCACGACCGGGACCG 48120 
LPHPSAPPPDGGRPPTTGTE 

4 8121 AACGGCTCGTCGCCCGGGTGTGGGCCGAACGCCTCGGACGGGAAGTCGTCGGCGTGGACC 4 8180 
RLVARVWAERLGRE VVGVD R 
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48181 GGGACTTCTTCTCCCTGGGCGGCGACTCCGTCCGGGCCCTCGGCGTGACGGCGGCCCTGC 4 824 0 
DFFSLGGDSVRALGVTAALR 

4 8241 GCGCCGCCGGGCTCCCGGTGACGGTCACCGACCTCCTGCGCCTGCCCACCGTGGCCGCCC 4 83 00 
AAGLPVTVTDLLRLPTVAAL 

4 83 01 TCGCCCGCCACGCCGACGAGCGGGCGGATCGCCGACCGGCGCGACAGGAGACGCCCCCCG 4 836 0 
ARHADERADRRPARQETPPG 

4 83 61 GGCCGTTCGCCCTCTGCCCGGAAGCCGCCGGCGTGCCCGGCCTGGAGGACGCCTACCCGA 4 842 0 
PFALCPEAAGVPGLEDAYPM 

4 8421 TGTCGATGGCCCAGCGGGCCGTGCTCTTCCACCGTGACCACAACCCCGGCTACGAGGTCT 4 84 80 
SMAQRAVL FHRDHNPGYEVY 

4 84 81 ACGTCACCAGCGTCGCCGTCTCCACGCCCCTGGACCGCACACGGCTCGCCGCGGCCGTGG 4 854 0 
VTSVAVS T PLDRTRLAAAVD 

4 8541 ACCGGCTGCTGGACCGGCACGCCTATCTGCGGTCCTCCTTCGACCTCGTGTCCCACCCGG 48600 
RLLDRHAYLRSSFDLVSHPE 

4 86 01 AGCCCACCCAGCTCGTCTGGACCCACCTGCCCACCCCGCTCGAGGTGGTGGAGTCGTCCG 4 86 6 0 
PTQLVWTHLPTPLEVVESSD 

4 8661 ACCCCGCCGGTTTCGACGCGTGGCTGCACGCCGAACGCAAGCGCCCCCTCGACGTCGGCA 4872 0 
PAGFDAWLHAERKRPLDVGT 

aaa. ...... 

s jJ 4 8721 CCGGACCGCTGGCCCGGTTCACCGCGCACGACGCGGGAGCCGCCGGATTCCGGCTGACCG 4 87 8 0 

vi! GPLARFTAHDAGAAGFRLTV 

%!''§ 4 8781 TCAGCAGCTTCGCCCTCGACGGCTGGTGCGTGGCCACCGTGCTCACCGAACTGCTCCGCG 4 884 0 

^ SSFALDGWCVATVLTELLRD 

M 4 8841 ACTACTGGTCCGCGCTGCGCGGCGCGCCCCTCAGCCTCCCGGCAGCCGCCGCCTCCTACC 4 8 90 0 

;rj YWSALRGAPLSLPAPAASYR 

U 4 8901 GCGAGTTCGTCGCCCTCGAACGCGCCGCCCAACACGATCCGGCGCACCGGGAGTTCTGGC 4 8 96 0 

EFVALERAAQHDPAHREFWR 

□ ...... 

\ 4 8961 GGACGGAGCTCGCCGGTGCCCGGCCGCATCCGCTGCCCCGCCGCCCGGTGCCACCGCCCG 4 9020 

^ TELAGARPHPLPRRPVPPPG 

s»# ...... 

M 4 9021 GGCCGGACGGGATCCGCCAGCACCGTCACGTCGTCCCCGTCGAGGACACCGTCGCCAAGG 4 908 0 

«=g PDGIRQHRHVVPVEDTVAKG 

□ 4 9081 GCCTGTCGGCGCTCGCCGGCGAGCTGGGTGTCGGGCTCAAACACGTTCTGCTCGGCGTCC 4 914 0 

LSALAGELGVGLKHVLLGVH 

4 9141 ACCTGCGGGTCGTCCGGGCCCTGTCCGGCGACCCCGACGTCATCACGGCCGTGGAGACCC 4 92 0 0 
LRVVRALSGDPDVI TAVETH 

4 92 01 ACGGCCGCCTCGAACGGCACGACGGCGACCGCGTCCTCGGGGTGTTCAACAACATCCTGC 4 92 60 
GRLERHDGDRVLGVFNNI LP 

4 9261 CGCTGCGGCAGCGGGTGGACGGCGGGAGCTGGGCCGACCTGGCCCGCGCCGCGCACGCCG 4 93 20 
LRQRVDGGS WADLARAAHAA 

4 9321 CGGAGGCGCGGACGGGGGAGTACCGCCGCTATCCGCTGGCCCAGGCACAGCGCGACCACG 4 93 80 
EARTGEYRRYPLAQAQRDHG 

4 9381 GCGCGGCCGGGCTCTTCGACACCCTCTTCGTGTTCACCCACTTCCACCTCTACCGCGCGC 4 944 0 
AAGLFDTLFVFTHFHLYRAL 

4 9441 TGGCCGACCTGGACGGCATGGCGGTCTCCGACCTGCGGGCCCCCGACCAG ACCT ACGT AC 4 9500 
ADLDGMAVSDLRAPDQTYVP 

4 9501 CGCTCACCGCCCACTTCAACGTCGACGCCACGGACGGCGGCGGCCTGCGGCTGCTGCTGG 4 9560 
LTAHFNVDATDGGGLRLLLE 

4 9561 AGTCGGACCCGCGGGAGTTCCCCGACGAGCAGGTCGCGGAGTTCGCCGCGTACTACCGCC 4 9620 
SDPREFPDEQVAEFAAYYRR 

4 9621 GCGCGCTGCGGGCCGCCGCCGACGCCCCGCACCGGCCGTACCGGGACACGCCGTTGACGG 4 9680 
ALRAAADAPHRPYRDTPLTD 
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4 9681 ACCGGCCGGCCGGTCCGGCGCCGCACCGCGCGGAGCGCTCCGTCCACGCCCTGTTCGCGG 4 9740 
RPAGPAPHRAERSVHALFAA 

4 9741 CCCCGGCC CGG AAC C AC C CGG AC CGG AT CG CG CT CG ACGG CGAGG ACGGGC CGGT CAG C C 4 9800 
PARNHPDRIALDGEDGPVSH 

4 9801 ACGGCGCCCTGGCCCGGCGCGCCGCCCGCCTCGCCGGAACGCTGCGGGCCGCGGGCGCCG 4 986 0 
GALARRAARLAGTLRAAGAG 

498 61 GGCCGGACACCGTCGTCGGGATCTGGGCGCCGCGCCGCGCCGACGCCGTCGTGGCGCTGC 4 992 0 
PDTVVGIWAPRRADAVVALL 

4 9921 TGGCCGCCCTCCACGCCGGAGCCGCCTACCTGCCCCTGGACCCGGTCCACCCGCCCCGGC 4 9 98 0 
AALHAGAAYLPLDPVH PPRR 

4 99 81 GGCAGCGGCAGGTGCTCACCGAGGCCGGCGCCCGCCTGCTCGT CCTGCCCGC CGGCCTCG 50040 

QRQVLTEAGARLLVLPAGLD 

50041 ACACCCCGCTCCGGGCCTGCGGCCTGCCCGTCGTGGCCCCGGACGACCTCGGCGCGCCCA 50100 
TPLRACGLPVVAPDDLGAP I 

5 0101 TCGCCCCCGTGTCCGTCCACCCGGAGCAGCTGGCGGCGGTCATGGCCACGTCCGGCTCCA 5 0160 

APVSVHPEQLAAVMATSGST 

50161 CCGGGACGCCCAAGACGATCGGCGTCCCGCAGCGCGCCCTGGCCGGCTACCTCCGCTGGG 5022 0 
GT PKT I GVPQRALAGYLRWA 

50221 CGATCGGCCACTACCGCCTCGACGAGGAGACCGTCTCCCCGGTGCACTCCTCGCTGGGCT 502 8 0 
IGHYRLDEETVS PVHS SLGF 

502 81 TCGACCTGACCGTCACCGCGCTGCTCGCACCGCTGGCCGCCGGCGGGCAGGCGCGGCTGA 50 34 0 
DLTVTALLAPLAAGGQARLT 

50341 CCGACTCCGGCGACCCGGGTGCCCTCGGCGCGGCACTGGCCGCCGGCCACCACACCCTGC 504 00 
DSGDPGALGAALAAGHHTLL 

504 01 TCAAGATCACCCCGGCCCATCTGGCCGCCCTCGCCCACCAGTTGGGCGCGCCGACCGCAC 504 60 
KITPAHLAALAHQLGAPTAL 

504 61 TGCGCACCGTCGTGGCCGGGGGCGAACCCCTGCACGCCGGCCACGTCCGCGCCCTCCGCG 5052 0 
RTVVAGGE PLHAGHVRALRA 

50521 CCTTCGCGCCCGGCGCCCGGCTCGTCAACGAGTACGGGCCGACCGAGACCACCGTCGGCT 50580 
FAPGARLVNEYGPTETTVGC 

50581 GCTGTGCCCACGACGTCGCACCGGACCCCGGCGAGGCGCCCATCCCCGTCGGTACCCCGA 50640 
CAHDVAPDPGEAPIPVGTPI 

50641 TCGCGGGCCTCAGCGCGTGCGTCGTCGACGACGCGCTGCCCGCACCGCCCGGCGTGCGGG 50 700 
AGLSACVVDDALPAP PGVRG 

507 01 GCGAGCTGTACATCGGCGGGACGGGCGTCACCCGCGGCTACCTGGGCCGGCCCGCGGCCA 50760 
ELYIGGTGVTRGYLGRPAAT 

507 61 CCGCCGCCGCCTACGTGCCGGACCCTGCCGCCCCCGGCGCCCGCCGCTACCGCACCGGCG 50820 
AAAYVPDPAAPGARRYRTGD 

50821 ACCTGGCACGCCGGCTGCCGGACGGCACCCTGCTCCTGGCGGGGCGCGCCGACCGCCAGG 5088 0 
LARRLPDGTLLLAGRADRQV 

50881 TGAAGATCCGCGGCCACCGGGTGGAACCGGGGGAGGTCGAGCAGGTGCTCGGCGGCCACC 50 94 0 
KIRGHRVEPGEVEQVLGGHP 

50941 CCGGGGTGCGGGAGGCGGCGGTCGTCGCCCACCCGGCACCCGGCGGCGGCCGCCGGCTGG 51000 
GVREAAVVAHPAPGGGRRLV 

510 01 TCGCGTACTGGGTACCGGCCGAACCGGCCCGGCCACCGTCCGCGGACGCGCTCACCGCGC 51060 
AYWVPAE PARPPSADALTAL 

51061 TGCTCGCCGACCGGCTGCCGCCGTACGCGGTCCCCGCCGAACTCGTCCGCCTGCCCGCCC 51120 
LADRLPPYAVPAELVRLPAL 

51121 TGCCCACCACCCCCAACGGCAAGGTCGACCACACCCGGCTGCCCGCGGCCGGACGGGACC 5118 0 
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PTTPNGKVDHTRLPAAGRDR 

51181 GGCGACTGGCGGAACTGCTCGACCGGATCGAGGCACTGTCCGACGCCGAGGCGGCCTCGG 5124 0 
RLAELLDRI EALSDAEAASA 

51241 CACTGCGCGACAGCCGGCCCGCACCCGGGAGTGGCGATGACCGAGCATGACGACCACCCG 513 00 
LRDSRPAPGSGDDRA* 

51301 CCGGCCCGCCGGGGCCCCGCCGGTTCCGCTGGCCCCGGCGGAAGCCCGCCCGTCCCGCAC . 513 60 

51361 GTGCCGGTGCCCGGGCATGACGACCGCGTCGGACGGCTGCCGGCGGACCGGAGCGTCCCG 51420 

51421 CCGACCCGCCGATTCTCTGGGGACCCCGCCGGTTCCGGTGGTGGCCCGCCCGTCCCGCAC 51480 

514 81 CCGGAGGTGCCGATGCGCGGGCATGACGACCGCGTCGGACGGCTGTCGGCGGACTGGAGC 51540 

MRGHDDRVGRLSADWS (orf21) 

51541 GTCCCGCCGACCCGCCTGCCCGCCGGGGACCCGGCCGGTTCCGTCGGCCCCGGCGGAGGC 51600 
VPPTRLPAGDPAGSVGPGGG 

51601 CCGCCCGTCCCGCACGAGGAGGTGACGATGTCGGAGTATGACGACCGCCTCGCGCGGCTG 51660 
PPVPHEEVTMSEYDDRLARL 

51661 TCGGACAACCAGCGCGCCCTGCTGGACCGCTGGCTCGCCGAGGACCCCGCCGGCGGTGCC 5172 0 
SDNQRALLDRWLAEDPAGGA 

51721 GGCCCGCTTCGCCCCGACGGCCGCCCGCCCCGCACCGAGGCCGAGCGGATCCTGGCCGGG 517 80 
GPLRPDGRPPRTEAERILAG 

'fcal ...... 

tI g 517 81 GTCTGGGAGGAGGTGCTGGAGACCGGCGGGATCGGCGCCGACGACGACTACTTCGCGCTC 5184 0 

>^a VWEEVLETGGIGADDDYFAL 

'HI 51841 GGCGGAGACTCCGTCCACGCCATCGTCATCGTGGCGAAGGCCCGGCAGGCCGGACTCGCC 51900 
UQ GGDSVHA IV I VAKARQAGLA 

m ...... 

^ - 51901 CTGACCGCCCATGACCTCTTCGAGGCCAGGACCCTCGCGGCCGTGGCGCGGAGAGCCGCC 5196 0 
^ LTAHDLFEARTLAAVARRAA 

j'3 51961 CCGGCCGGCCCCGCCGAGCCCGTCCCCGACGCGGGCGGCGGCGCGGTCCGGTACCCGCTG 52 02 0 

PAGPAE PVPDAGGGAVRYP L 

£3 52021 ACCCCTATGCAGCAGGGCATGCTCTACCACTCGGCCGGCGGCAGCACGCCCGGCGCCTAC 52 080 

\n TPMQQGMLYHSAGGSTPGAY 

52 081 GTGGTGCAGGTGTGCTGCCGGCTGACGGGGGACCTCGACGTGGCCGCCTTCCGCACCGCC 5214 0 
O VVQVCCRLTGDLDVAAFRTA 

52141 TGGCAGGCCGTGCTGTCCGCCAACCCGGCGCTGGCCGTCTCCTTCCACTGGTCCGACGGC 52200 
WQAVLSANPALAVSFHWSDG 

522 01 TCCCCGCCCGAGCAGGTGGTGGACCCCGACGCGCGCGTCACCGTCGACACGGCCGACTGG 52 2 60 
S PPEQVVD PDARVTVDTADW 

52261 CGGGACCGCACCCCGGCCGAGCGGGACGATGCCTTCGCCCGCTTCCTGGACACCGACCGC 52320 
RDRTPAERDDAFARFLDTDR 

52321 GCGGCGGGCTTCGACCTCGCCCGCGCCCCGCTGATGCGGCTGACGCTCTTCCGCGAGGGC 52380 
AAGFDLARAPLMRLTLFREG 

52381 GAGCACGCGTACCGCTGCGTGTGGACCCACCACCACCTCGTCCTCGACGGCTGGTCCCAG 52 4 4 0 
EHAYRCVWTHHHLVLDGWSQ 

52441 CAGCTCGTCCTGCGCGACGTCCTCGACTGCTACATGCGCCTGCGCGCCGGACGCGGCGCC 52 500 
QLVLRDVLDCYMRLRAGRGA 

52501 GAGCCGCCCGCCCGGCCGTCCTTCACCGGTCATCTGCGCCGGCTGGAGCGGCAGGACGGG 52 560 
EPPARPSFTGHLRRLERQDG 

52561 ATCGACGAGGAGTTCTGGCGCGACCACCTCGGCGGCCTGCCCGCACCCTCCCGCGTCGCC 52 62 0 
IDEEFWRDHLGGLPAPSRVA 

52621 GGTCCCGGCTGCCGCGACGGCCGGGTGGTCGCCGTACGGCGCGCCGAGCACCGGCACCGG 52 680 
GPGCRDGRVVAVRRAEHRHR 
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52 681 GTCTCCGCGGCGACGGGCCGGGAGCTGACCGGCTTCTGCCGCCGCCACGGGCTGACCCCG 
VSAATGRELTGFCRRHGLTP 

52 741 GCCGCCGTGCTGCACGGCGGCTGGGCGGTGCTGCTGTCGCTGCACTGCGGCCAGGACGAC 
AAVLHGGWAVLLS LHCGQDD 

52 801 GTGGTCTTCGGCACCACCCTCTCCGGCCGCCCCGAGGACCTGCCCGGCGTGACCGAGTGC 
VVFGTTLSGRPEDLPGVTEC 

52 861 GTCGGCCTCTTCATCAACACGCTTCCCCTGCGGGTCCGTTGCGGGGAGGACACGGACGTC 
VGLFINTLPLRVRCGEDTDV 

52 921 GTCGACTGGCTCCACGGCGTCCAAAGCGACCTGGCCGCCCTGTGGGACCACGCGCACGTc 
VDWLHGVQSDLAALWDHAHV 

52 981 CCGCTCAGCCGCGTcGAGCGCGGTCTCGGACTGGGCCGGgGCGGCGGGCTGTTCGACAGC 

PLSRVERGLGLGRGGGLFDS 

53 041 ATCATGGTCGTCGAGAACTTCCCCGCCGCCGTCGCCGACGGCCACGAGGCGGgCGGGCTG 

I MVVENF PAAVADGHEAGGL 

53101 CGGGTGACGGAGCCCCGGGCACTCGTCGACGAGGGCTACCCCCTCGTACTGGAGGCCACC 
RVTEPRALVDEGYPLVLEAT 

53161 ACCGGGGACCGGCCGGTGCTGCACGCCCGCTACGACCCCCACCGCCTCGCCGGCGGGCGG 
TGDRPVLHARYDPHRLAGGR 

53221 GTCCAGGCGCTGCTCGCCGCCTTCGACGACTACCTCCGGGCGGTGACCGCCGACCCGGCC 
VQALLAAFDDYLRAVTADPA 

53281 CGCCCGCTGCCGGACCTCCGCGCGGTCCTGGCCCGCGACCACGCGCGCCGGGACGGCGCG 
RPLPDLRAVLARDHARRDGA 

53 341 GCACGCGGGCGGCGCCGCGCCGCGGACCGCACCCGTCTGACGCTGGCCCGCCGCCGCCCG 
ARGRRRAADRTRLTLARRRP 

53 4 01 GCGACGACGACCGAGGGAGAGACACCGTGACATGGACCGTGGTGACCGGAGCCGGCGGCT 
ATTTEGETP* 

MTWTVVTGAGGF 

53 4 61 TCATCGGCTCCCACCTCGTACGCCGCCTCGTCCGGGACGGGCACCGGGTCCGCGGCGTGG 
I G S H LVR RLVRD GH RV RGVD 

53 521 ACCTGGTGCCGCCGCGCTACGGCCCCGGCGAGGCCCAGGAGTTCGTCATCGCCGACCTGC 
LVPPRYG PGEAQEFVIADLR 

53 581 GCGACGCGGCGCAGGCCGCGCGGGCCGTCGCCGGCGCGGACTCCGTCTTCGCGCTCGCGG 
DAAQAARAVAGADSV FALAA 

53 641 CCAACATGGGAGGCATCGGCTGGACCCACACCGCGCCCGCCGAGATCCTCCACGACAACC 
NMGGIGWTHTAPAE I LHDNL 

53 701 TGCTGATCTCCACCCACACCATCGAGGCATGCCGGGCCGCCGGCGTGCGCACCACCGTCT 
LISTHTI EACRAAGVRTTVY 

53761 ACACCTCCTCGGCCTGCGTCTACCCCGCGTCCCTGCAGCGCGAGCCCGACGCCGCGCCGC 
TSSACVYPASLQREPDAAPL 

53 821 TGGCCGAGGACCCGGTCTTCCCCGCGGAACCCGACATGGAGTACGGCTGGGAGAAGCTGA 
AEDPVFPAEPDMEYGWEKLT 

53 881 CCACGGAAATCCTGTGCGGCGCCTACCGCCGCAGCCACGGCATGGACATCAAGACAGCCC 
TEI LCGAYRRSHGMDI KTAR 

53 941 GGCTGCACGCCATCTACGGCCCCCTCGGCACGTACACCGGGCCCCGCGCGAAGTCCCTGT 

LHAIYGPLGTYTGPRAKSLS 

54 001 CGATGCTCTGCGACAAGGTCGCCCGGATACCCGGCGACGAGGGGGAGATAGAGGTCTGGG 

MLCDKVARI PGDEGE I EVWG 

54 061 GGGACGGGACGCAGACCCGCTCCTACTGTTACGTCGACGACTGTGTCGAAGGGCTGATCC 
DGTQTRSYCYVDDCVEGLIR 

54121 GGCTCGCCCGCTCCGACGTGGCGGAACCGGTCAACATCGGCTCCGAGGAGCGCGTCGACA 
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LARSDVAEPVNIGSEERVDI 

54181 TCGCGTCGCTCGTCGAGCGGATCGCCGGGGTCGCCGGGAAGAAGGTGCGCTGCGCCTTCG 5424 0 
ASLVERIAGVAGKKVRCAFA 

54241 CCCCCGACCGCCCGGTCGGGCCCCGCGGGCGCGTCTCGGACAACACCCGCTGCCGCGAAC 54300 
PDRPVGPRGRVSDNTRCREL 

54 3 01 TGCTCGGCTGGGCACCGGAGACGTCCCTCGCGGCCGGCCTGGAGCGCACCTACCCGTGGA 543 6 0 
LGWAPETSLAAGLERTYPWI 

54 361 TCGAGCGCCAGGTCCTCGCCGAGGCCGGGAGGGCCGATGCCTGAGCACCGCACACCGGTG 5442 0 

M (orfl9) 

ERQVLAEAGRADA * 

54421 AAGGACCTCGGCCGGCTGCTGCTCGGGCACGCCGCGCGCTTCCGGGGCCGCGAGCTGCAG 5448 0 
KDLGRLLLGHAARFRGRELQ 

54 4 81 GACGTCGCCACCCGGGCGCTGCGGGCCTCCGGCGGGGAGAACGCCTGGGTGGTGTCCGTC 54 54 0 
DVATRALRASGGENAWVVSV 

54 541 GTCAACACCAGTCTCCGCGCCCGCCAGGCCGTGGACCACGCGCTGCGGCTCGCCCCCCGC 54 60 0 
VNTS LRARQAVDHALRLAPR 

54 601 CGCGGGCTCTCCCGGCTGCGCTACCCGTTCTCCGCCGCCCACCACACGGCCACCCCGCCC 54 660 
RGLSRLRYPFSAAHHTATPP 

54 661 CGGACCCTGTCGCTGCTGTGCCCGACCCGCGAACGCGTCGGCAACGTCGAACGCTTCCTC 5472 0 
iB RTLSLLCPTRERVGNVERFL 

..... 

^ 54 721 GACAGCGTCGCCCGCACCGCCGCCGCGCCCGGCCGGATAGAGGCCCTCTTCTACGTCGAC 5478 0 

.1 DSVARTAAAPGRI EALFYVD 

M 54781 GACGACGACCCCCAACTCCCTGCCTACCACGAGCTGTTCGAGCACGCCCGGTGGCGCTAC 54840 

;h DDDPQLPAYHELFEHARWRY 

;as ...... 

^ 54 841 GGACGGATCGGCCGGTGCGCCCTGCACGTCGGCGCCCCCGTCGGCGTACCCCACGCCTGG 54 90 0 

; _ GR I GRCALHVGAPVGVPHAW 

^ 54 901 AACCACCTGGCCCGGAACGCGGCCGGCGACGTGCTGATGATGGCCAACGACGACCAGCTC 54960 

lZ NHLARNAAGDVLMMANDDQL 

s 

M 54 961 TACATCGACTACGGCTGGGACACCGCCCTCGACGCCCGCGTCACCGAACTGAGCGCCCTG 5502 0 

^ YIDYGWDTALDARVTELSAL 

3 5 5021 CACCCCGACGGCGTCCTGTGCCTGTACTTCGACGACGGCCAGTACCCCGAGGGCGGCTGC 55080 

HPDGVLCLYFDDGQYPEGGC 

5 5081 GACTTCCCGATGGTGACACGGCCCTGGTACGGCACCCTCGGCTACTTCACCCCGACGATC 5514 0 
DFPMVTRPWYGTLGYFTPTI 

55141 TTCCAGCAGTGGGAGGTCGAGAAGTGGGTCTTCGACATCGCCGACCGGCTGCACCGGCTC 5520 0 
FQQWEVEKWVFDIADRLHRL 

552 01 TACCCCGTCCCCGGCGTCCTCGTCGAACACCGGCACTACCAGGACTACAAGGCACCCTTC 5526 0 
YPVPGVLVEHRHYQDYKAPF 

552 61 GACGCCACCTACCAGCGGCACCGGATGACACGGGAGAAGTCCTTCGCCGACCACGCCCTG 5532 0 
DATYQRHRMTREKS FADHAL 

55321 TTCCTGCGCACCGAGCCGGACCGCGAGGCGGAGACGGACAGGCTGCGGGCCGTCATCGCC 553 8 0 
FLRTEPDREAETDRLRAVIA 

55381 CGGGCAGGGAACACCCCGGACGCCGACCACGCCGACCATGCCGTTCACGACGCGGAGACC 5544 0 
RAGNTPDADHADHAVHDAET 

55441 TTCTGGTTCACCGGCCTCCTGCGCGAGTCCCACGCCAAGCTGCTCGCGGAACTCGACGAC 5550 0 
FWFTGLLRESHAKLLAELDD 

55501 GCGCCGGGCCCGGCCGCCGGAGCCGTGCTCTTCGCCGACGGCTCCTGGACCGGCGTCGCC 55560 
APGPAAGAVLFADGSWTGVA 

55561 TACCGCACCCACCCGCTGGCCACCGCCCTGCTCGCCTCGATCCCCGAGGCCACCCTCGAC 55620 
YRTHPLATALLAS I PEATLD 
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55621 TCCGGCCGCGCCGACCTCCTCGTCGTCCCGCCCGGCGCGTCCCACCACCACCCCGACGGC 5568 0 
SGRADLLVVP PGASHHH PDG 

55681 ACCGTCGACTCCGCGTTCGGCTCCGACGCCGGCCTCCGCGTCCTGTTCGGACTGCGCGTG 5574 0 
TVDSAFGSDAGLRVLFGLRV 

55741 CCGGACGCCGCGCAACTCCGCGTCGGCGACGGCCCGGTGCCCTGGGGCAATGGGCAATGC 55800 
PDAAQLRVGDGPVPWGNGQC 

558 01 CTGATCCACGACACCGCCGCACCGAGCACCCTGCGCAACGACGGCACCGAATCTCTGGCC 55 860 
LIHDTAAPSTLRNDGTESLA 

558 61 GCCCTCACCTTCGTGGTGCCGCGCCCGGCACCGGGGGAGTGAGGCCCGTGTGCGGCATCG 5592 0 

ALTFVVPRP APGE * 

MRPVCGIV (orfl8) 

55921 TGGCGATCCGCTCCGCCGACGGCGGACTCGACGGCGGTGAACTCACCGCGCCGATGGCCG 55 980 
AIRSADGGLDGGELTAPMAD 

559 81 ACCTGCGCCCGCGCGGCCCCGACGGCGAAGGCACCTGGGTCTCGCCCACCGGCCGGGCCG 56 04 0 

LRPRGPDGEGTWVSPTGRAA 

56041 CCCTCGGCCACACCCGGCTCGCCGTGATCGCCCCCGACGCCGGACGCCAGCCGGTCGCCG 56100 
LGHTRLAV IAPDAGRQPVAG 

56101 GCCCGGACGGCACCGTCCGGCTCGTCGTCAACGGCGAGTTCTACGGCTACCGGGAGATCC 56160 
PDGTVRLVVNGEFYGYRE I R 

56161 GCGCGGAACTGCGCGCCGc CGGCTGCCGGTTCCGCACCGGCAGCGACAGCGAGATCGCCC 5622 0 
AELRAAGCRFRTGSDSEIAL 

56221 TCCACCTGTACCTGCGGGACGGCCGGCGGGCACTGGAGCGGCTGCGCGGCGAGTTCGCCT 562 8 0 
HLYLRDGRRALERLRGEFAF 

562 81 TCGTCCTCTGGGACGAACGCCGCGCCACCCTCTTCGCCGCCCGCGACCGGTTCGGCGTCA 5634 0 

VLWDERRATLFAARDRFGVK 

563 41 AACCCCTCTACTACACCGAGCGCGACGGGCGGCTCTACGTCGCCTCGACGGTCAGGGCCC 564 00 

PLYYTERDGRLYVAS TVRAL 

564 01 TGCTCTCCTGCGGCGCCCCCGCCCGCTGGGACACCGCCGCCTTCGCCGCGCACCTGCAGC 56460 

LSCGAPARWDTAAFAAHLQL 

56461 TCGGCCTGCCCCCCGACCGCACCCTCTTCGCCGGCATCCGGCAGCTCCCGCCCGGCTGCC 56520 
GLPPDRTLFAGIRQIiPPGCH 

56521 ACCTCATCGCCGACGCCCACGGCACCCGCGTCACCCCCTACTGGGACCTCGACTACCCGC 56 580 
LIADAHGTRVTPYWDLDYPP 

56581 CCGCCGGCGAACTCGCCGCCCGGGGAAGCCTGGACGACCACCTGGACGCGGTACGCGAAC 5664 0 
AGELAA RGSLDDHLDAVRER 

56641 GGACCGACGAGGCCGTACGGTTGCGTACCGTCGCCGACGTGCCCCTCGCCTGCCACCTCA 567 0 0 
TDEAVRLRTVADVPLACHLS 

567 01 GCGGCGGCCTGGACTCCTCCGCCGTCGCCGCCTCCGCCGCCCGCCACACCCGGCTCACCG 5676 0 
GGLDS SAVAASAARHTRLTA 

567 61 CCTTCACCGTCCGCTTCGACGACCCCGCCTTCGACGAGAGCGCCGTCGCCCGGCGCACCG 56 82 0 
FTVRFDDPAFDESAVARRTA 

56821 CCGCCCACCTGGCCATCGACCACCGCGAAGTCGCCTCGGAACGCGCCCACTTCGCGGACC 56880 
AHLAI DHREVASERAHFADH 

56881 ACCTGCGGGACGTCGTCCGCGCCGGCGAGATGGTGCAGGAGAACTCGCACGGCATCGCCC 5694 0 
LRDVVRAGEMVQENSHGIAR 

56941 GGTACCTGCACAGCGCGCACATCAAGAAGGCGGGATTCACCGCCGTCCTCGCCGGGGAGG 57000 
YLHSAH I KKAGFTAVLAGEG 

570 01 GCGGGGACGAACTGTTCCTCGGCTACCCCCAGTTCCGCAAGGACCTGACGCTCAGCCTGT 57 060 
GDELFLGYPQFRKDLTLSLS 
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57061 CCGCCGACGCCCGCGACAAGGCCGACCGCGGCTACGCCCGGCTGGTCGCGGCCGGGCTCC 5712 0 
A.DARDKADRGYARLVAAGLL 

57121 TGCCGCCGTACCTGCGCACCCTCCTCGGCACCCTCGGCTTCCTGCCCTCCTGGATCGTCG 5718 0 
PPYLRTLLGTLGFLPSWIVD 

57181 ACCGCCACCTGGCCGTCACCCAGCCCGTCGCCGCCCTGCTCCGCCCCGACTTCGCCGCCG 5724 0 
RHLAVTQPVAALLRPDFAAE 

572 41 AACTGGCCCGCGCCGACGCCGCCGCGCCCCTGCTCGCCGCCGGCGCCGGCCTGCTCGCCG 573 0 0 
LARADAAAPLLAAGAGLLAG 

57 3 01 GGCGCGCCCCGGCGCACCAGGCCACCTACCTCTTCGCCAAGACCTGGCTGCCCGGCTACC 573 60 
RAPA- HQATYLFAKTWLPGYL 

57 3 61 TGCTCGCCGCCGAACGCCTCGACGCGGCCCAGGCCGTCGAGGTGCGGCTGCCCCTCTTCG 5742 0 
LAAERLDAAQAVEVRLPLFD 

57421 ACCACCACCTCTTCGACCTCGTCCGGCACACCCCGCCGGCCTGGTACGACAAGGACGGCA 574 80 
HHLFDLVRHTPPAWYDKDGT 

574 81 CCGGCAAGTACCCGCTGCGGGCCGCCATGCGCCACCGGCTGCCGCGCGAGGTGACCGAGG 5754 0 
GKYPLRAAMRHRLPREVTEG 

57 541 GCCGCAAACAGGGCTTCCTCGCACCTCCGATGGCCGACGACGACACCCTCCTCGACGCCC 57600 
RKQGFLAP PMADDDTLLDAL 

57 601 TGCGCGAACGCCTCGCCGGACCGGGCGCGGGCGACGACCCCTTCTTCGACCCGCACGCCG 57 66 0 
RERLAGPGAGDDPFFDPHAV 

57 661 TCCGCGCCCTGCTGGACCGGCTGGCCGCCGCACCCCCGGGGCAGCGGTCCGGCGGCGAGA 57 72 0 
RALLDRLAAAPPGQRSGGEK 

57721 AACTCCTCCAACTCGTCGCGAGCACCGCCGAACTGGCCGACGAGTTCGGCCTCACCACCG 57 7 80 
LLQLVASTAELADEFGLTTA 

57781 CCCCCAGCGGGCAGAAAGGCGGCAACGGTGGCTGACCTCGATCCCGGCACGCTCTCCGAG 57 84 0 
PSGQKGGNGG* 

57 841 GCCGAGCTGACCGCCCGGATCGCCGCCCTGTCCCCCGAACGCCGGGCGGCGTTCGAGAAG 5790 0 

57 901 ATGCTGCACGGCGCCGCGCACCCCCGCCCCGGCATCCCGCGCCGCGGCGCCACCGCGGCA 57 96 0 

MLHGAAH PRPG I PRRGATAA (orfl7) 

57961 CCGGCCTCCTACGGCCAGGAACGCCTGTGGCTGCTCACCGGGCTGCTGCCCACCGCCTAC 58 02 0 
PAS YGQERLWLLTGLLPTAY 

58 021 AACTACGCCACCGCCCTGCGGCTGCGCGGCGACCTGTCCGTCCCCGCGCTGCGCGGCGCC 58 08 0 

NYATALRLRGDLSVPALRGA 

58 081 CTGCGCGGCATCGTCCGCCGCCACGAGGTGCTGCGCACCACCTTCCGGCTGGACGGCGAC 5814 0 
LRG IVRRHEVLRTTFRLDGD 

58141 GACCTCATCCAGGTCGTCCACCCCACGGCGGACGTCCCCGTGCGCCTGGCCGACCTCACC 582 0 0 
DL I QVVHPTADVPVRLADLT 

582 01 GGACGCTCCGCCGACACCGGGCGGCTGATGCGCGAGGAGGCCCGCCGCCCCTTCGACCTG 58260 
GRSADTGRLMREEARRPFDL 

58261 GAGCACGGGCCGCTGCTGCGGCTGACCCTCTTCCGGCTCGGCCCCCGCGACCACCTCGCC 5832 0 
EHGPLLRLTLFRLGPRDHLA 

58321 CTGCTGGCCGTCCACCACGCCGTCACCGACGGCTGGTCCAACGGCGTCCTCGTGACCGAA 583 8 0 
LLAVHHAVTDGWSNGVLVTE 

58381 CTCGCCACCGGCTACCGGGAACTGCGCGCCGGACGCCCCGACCGGCGGCCCGCCCCGCCG 5844 0 
LATGYRELRAGRPDRRPAPP 

58441 GTCCAGTACGGCGACTACGCGCACTGGCAGCGCGAGCGGCTGACCGGGCCCGAACTGCGG 58 500 
VQYGDYAHWQRERLTGPELR 

58501 G C C CTGG AGG ACT ACTGG CGC AC CGC CGT ACG CG AC CTG C C CAGG ACGG AC CTG CCCAC C 58560 
ALEDYWRTAVRDLPRTDLPT 
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58561 GACCGCCCCCGCCCCGCCGCCCGGCGCGGCGAGGGCGCCAACCACGCCCTGCTGCTCTCG 58620 
DRPRPAARRGEGANHALLLS 

58621 CCGGAGCTGACCGGCCGGCTCGCCGACCTGCGCCGCCGCGAGGGCGGGTCGCTGTTCATG 58 680 
PELT GRLADLRRREGGSLFM 

5 8681 CTCGTGCTCTCCGCGCTCCTGGTCGTCCTGCGTGGCACCGGCGGCCGGGACCGGCTCGCC 58740 
LVLSALLVVLRGTGGRDRLA 

58741 GTCGGCACCCTCGTCGCCGGCCGCACCCGCCCCGAACTCGAGCCGCTCATCGGCTACTTC 58800 
VGTLVAGRTRPELEPLIGYF 

58801 GTC7VACGTCCTGCTGCTGCCCTTCGAGACCGGCGGCCGGACCTCCTTCGCCGAGCTGTGG 58 860 
VNVLLLPFETGGRTSFAELW 

58861 CGGCGGGTCCGCGGCCGGCTGGTGGAGGCGTACGCCCACCAGGAACTGCCGCTGGAGAAG 58 920 
RRVRGRLVEAYAHQEL PLEK 

58921 GCCCTGGAGCTGCTGCGCGCCGACGGCACCGCCCCCGCCGACCCGCCGGTCGGCGTGGTC 58 980 
ALELLRADGTAPADPPVGVV 

5 8 981 TGCGTCGCCCAGCAGCCCGCCCCCGCGATCACCCTGCCCGGACTCGACGCGAGCGTCGAG 59 040 
CVAQQPAPAI TLPGLDASVE 

5 9041 GACGTCGACCTGGGCACCGCCCAGTTCGACCTCGTCGTCGAGGTGCGCGAACGGCCGGAA 59100 
DVDLGTAQFDLVVEVRERPE 

5 9101 GGCGTGCAGATCGCCTTCCAGTACGACCGGGACCTGTTCGACGCGGCCACGGTCCGGCTC 5916 0 
GVQ IAFQYDRDLFDAATVR L 

5 9161 CTCGCCGACCACGTGCACGCCGTCCTCGACCAGGCCGCCGCCGACCCCACCCTGCCCTGT 5 922 0 
LADHVHAVLDQAAAD PTLPC 

5 9221 GCCGAGCTGCCCGCCCCGCCGGCCCCCGCGGCCCCGGCCCGCACGGCCGGCGCCACGACG 59280 
AELPAPPAPAAPARTAGATT 

5 92 81 CTGCACGCCCTGTTCGAGTCCCGCGCCGCGAAGAGCCCCGACGCGGTCGCCCTCGTCGAC 5934 0 
LHALFESRAAKSPDAVALVD 

59341 GGCGGCCACCGCGTCACCTACCGGACCCTCAACACCCGCGCCAACCGGCTCGCCCGCCAC 594 00 
GGHRVTYRTLNTRANRLARH 

59401 CTGCGCGCGGTCGGCGTGCGTACCGAGGACCGGGTGGCGCTGCGCCTGCCCCGCGGCACC 59460 
LRAVGVRT EDRVALRL PRGT 

59461 GACGCGGTGACCGCCACCCTCGCCGCCCTCAAGGCCGGCGCCGCGTACGTACCCCTCGAC 59520 
DAVTATLAALKAGAAYVPLD 

5 9521 CCCGCCCTCCCCGAGGAACGGCTGACCCGCGTCCTCGCCGACGCCCGCCCCGCCGTGGTC 5 958 0 
PAL PEERLTRVLADARPAVV 

5 9581 CTCACCCCCGCGTATCTGCACGACCGGTCCGCCGAGATCACCGCCCACGCCGGCCATGAC 59 640 
LTPAYLHDRSAE ITAHAGHD 

59641 CTCAACCTCCCCGTCCACCCCGACAACCTCGCCTACCTCCTCCACACCTCCGGATCCACC 59700 
LNLPVHPDNLAYLLHTSGST 

59701 GGCACCCCCaAGGGCGTCCTCGGCAcCCACCGGGGCGCGGTCAACCGCGTCGACTGGATG 59760 
GTPKGVLGTHRGAVNRVDWM 

5 9761 AGCACCGCGTACCCGTTCCGGACCGGCGACGTGGCCGTCGCCCGCACCGCGCCCGGCTTC 5982 0 
STAYPFRTGDVAVARTAPGF 

59821 GTCGACGCGGTCTGGGAACTCTTCGGCCCCCTGGCCGCCGGCGTCCCCCTCGTCCTCCTG 59880 
VDAVWELFGPLAAGVPLVLL 

5 9881 CCGACCGACGAGGCGCGCGACCCGGCCCTGCTGACGGCGGCGCTGGAACGGCACCGGGTG 59 940 
PTDEARDPALLTAALERHRV 

59941 AGCCGGATGGTGACGGTCCCGTCGCTGCTGACCATGCTCCTGGACGAGTCCGCCCGCGCG 60000 
SRMVTVPSLLTMLLDESARA 

60001 ACGGACCTCGGCACCCGCCTGGCCTGCCTCCGCACCTGGATCACCAGCGGCGAGCCCCTG 60 060 
TDLGTRLACLRTWITSGE PL 
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60061 CCGCCCGCGCTCGCCCGGCGGTTCCACGACCGCCTGCCCGGCCGCACCCTGCTGAACCTG 6012 0 
PPALARRFHDRLPGRTLLNL 

60121 TACGGCTCCTCCGAGACCGCCGCCGACGCCACCGCGGCCCGCATCGACCCGGCGCCCGGG 6018 0 
YGSSETAADATAARI DPAPG 

60181 ACTGCGCTCCCGGAGCGGTCCCCGATCGGCACGCCCATCACCGGCGTCAGCGCCCTCGTC 6024 0 
TALPERSP IGTPITGVSALV 

60241 CGCGGCCCGGACCTGCGCCCGCTGCCCGCGCTGATGCCCGGCGAGCTGTACGCCGGGGGC 603 00 
RGPDLRPLPALMPGELYAGG 

603 01 GCGTGCGTGGCCCGCGGCTACCACGCCCGTCCGGCCGAGACCGCCGCGGCGTTCCCGCCG 60360 
ACVARGYHAR PAETAAAF PP 

603 61 GATCCCGACGGCGGGCCCGGCGCCCGGATGTTCCGTACCGGTGACAGGGCCCGGCTGCGG 6042 0 
DPDGGPGARMFRTGDRARLR 

60421 GCCGACGGCCGGCTGGAACTCCTGGGGCGCGTGGACCGGCAGGTGCAGATCCGCGGCCAG 604 80 
ADGRLELLGRVDRQVQIRGQ 

60481 CGCGCCGAGCCCGGCGAGGTCGAACACGCCCTGCTGGCCCACCCGGCCGTACGGGCCGCC 6 054 0 
RAE PGEVEHALLAH PAVRAA 

6 0541 GCCGTCACGGCGAACCCCGACGCCACCGGCCTGTGGGCGTACGTGCGGCTCGCTCCCGGC 6 06 0 0 
^ AVTANPDATGLWAYVRLAPG 

hQ 60601 CCGTTCGCCGCCGGCTCCCCCCAGACCGAGCTGACCGCCTTCCTGCGCCGCACGCTCCCT 6 066 0 

:c g PFAAGSPQTELTAFLRRTLP 

/ 1 60661 GCCCACCTCGTGCCCACCGCCGTCACCGTCCTGGACGAGCTGCCGGTGACCGCGCACGGC 60720 

*j AHLVPTAVTVLDELPVTAHG 

; =0 ...... 

m 6 0721 AAGACCGACCACGCGCGGCTGCCCGCCCCCGACCCCCGGGCCGGGCGCCCCGCCCCGACC 60780 

:!;= KTDHARLPAPDPRAGRPAPT 

* • 6 0781 GCCCCCCGCACCCCCACCGAGCGTACGGTCGCCGACGTCTTCGCCGGGGTGCTCGGCCTG 60840 

□ APRTPTERTVADVFAGVLGL 

60841 GAGGGGCCGGTCGGCGCGCACGACGACTTCTTCCTCCTCGGCGGGCACTCCCTCCTCGCC 60900 
KJ EGPVGAHDDFFLLGGHSLLA 

60 901 GCCCGCAGTCGCGGCGGAACTCCGCGCCCGCCGCGGCGTCCGGATCGGGCTGAGCGACGT 6096 0 
ARSRGGTPRPPRRPDRAERR 

60961 CTTCGCGGCCCCCACCGTCGCCGCAGCGTCGCCGCCCGGACCGACGCCGCCCGGCCCGGC 6102 0 
LRGPHRRRSVAARTDAARPG 

61021 ACCGGCCCCGAGCACACCCCGTTCGTCACCGACCCCGGCGCCCGGCACGAGCCGTTCCCG 6108 0 
TGPEHTPFVTDPGARHE PFP 

61081 CTCACCGACGTCCAGCGGGCCTACTACGTGGGACGCGAGGGCGGGTTCGCCCTCGGCGGC 6114 0 
LTDVQRAYYVGREGGFALGG 

61141 GTCTCCACCCACGCCTACCTGGAGATCGAGGCCCCGCGGATCGACGTCGCACGGTTTACC 612 0 0 
VSTHAYLE I EAPRIDVARFT 

61201 GG CG CG CTG CG CGGGGTGATCG C C CGG C AC G C C ATGCTG CGCG C CG TG AT C CGT C C CG AC 61260 
GALRGVIARHPMLRAVIRPD 

61261 GGGCTCCAGCAGGTGCTCACCGACGTCCCCCCGTACGACGTGGCCGTGCACGACCTGCGC 6132 0 
GLQQVLT DVPPYDVAVHDLR 

61321 GACCTGGACGAGCCCGCGCGGCAGCGCCGACGCGCCGCGCTGCGCGAGGAGATGTCCCAC 61380 
DLDEPARQRRRAALREEMSH 

613 81 CAGGTGGTGCCCGCCGACCTCTGGCCCCTGTTCGACGTCCGCGTCTCCCTCGGCCCCACG 6144 0 
QVVPADLW P LFDVRVS LG PT 

61441 GACGCCCTCGTCCACGTGGGGGTGGACGCGCTGATCTGCGACGCCCACAGCTTCGGCCTC 61500 
DALVHVGVDALICDAHS FGL 

61501 GTCCTGGCCGAACTCGCGGCCCGTTACGCCGACCCCGCACGCCGCTTCCCGCCCCTGACG 61560 
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VLAEL.AARYADPARRFPPLT 



61561 GCGGACTTCCGGGACCACGTCCTCCATCAGGAGGCGCTCCGCGGAACCGCCGAGTACGCG 
ADFRDHVLHQEALRGTAEYA 

61621 GCGGCGGAGCGGTACTGGCGCGAACGCCTGCCCGAGCTGCCGCCCGGCCCCGAACTGCCC 
AAERYWRERLPELPPGPELP 

61681 CTGGCCGTCGCGCCCGAGACCCTCGGCACCCCGCGCTTCACCCGCCGCTCCGGCCGGCTG 
LAVAPETLGTPRFTRRSGRL 

61741 GACGCGGCCTCCTGGACGGCGGTCAAGGACCGGGCCCGCCGCGCCGGGCTCAGCCCCTCC 
DAASWTAVKDRARRAGLSPS 

61801 GGCGTACTGCTGGCGGCGTTCGCCGAGGTGATCACCGCGTGGAGCGGCCGGCCGCGCTAC 
GVLLAAFAEVITAWSGRPRY 

61861 TCGCTGATGCTGACGGTCTTCGACCGCCCGCCGCTCCACCCGGACCTCGGGCGGATCGTC 
SLMLTVFDRPPLHPDLGR IV 

61921 GGCGACTTCACCTCGCTCAGCCTGCTGGAGGTCGACCACAGTCGGCCCGGCGACTTCACC 
GDFTSLSLLEVDHSRPGDFT 

61981 GACAGGGCCCGCGCCCTCCAGCGCCGCCTGTGGCAGGACCTCGACCACCTGGCGGTCGGC 
DRARALQRRLWQDLDHLAVG 

62041 GGCGTGACGGTGACACGGGAACGGGCGCTGCGCCACGACGCCCGACCCGGTCTGCTCACA 
GVTVTRERALRHDARPGLLT 

62101 CCCGTCGTCTTCACCTCCGACCTGCCTGTCGGCGAGACCGCGGCCGAGGACGCGGACGGG 
PVVFTSDL PVGETAAEDADG 

62161 GGAGAGGGATGGGCGCTCGGAGAGCCCGTCTACGGCGTCAGCCAGACCCCGCAGGTCCAT 
GEGWALGE PVYGVSQTPQVH 

62221 CTCGACCATCAAGTCGCCGAAGACCGAGGGGAGTTGGTCTTCAACTGGGACGCCGTGGAA 
LDHQVAEDRGELVFNWDAVE 

622 81 GACCTGTTCGCCCCGGGCGCCCTGGACGCCATGTTCGCCGCCTACACCGCCTCGCTGACC 
DLFAPGALDAMFAAYTASLT 

62341 CGCCTGGCCCGGAGCCCCGAAGCCTGGCGGCGGCCCGGCACGCCGCCGCTGCCCACCGCC 
RLARS PEAWRRPGTPPLPTA 

624 01 CAGGCGGCCGTGCGCCGGCGCACCGCCGCGACCGAGGCGCCCCTGCCCGCCCGCCTGCTG 
QAAVRRRTAATE APL PARL L 

62461 CACGAGGCCGTCGGCGACGCGGCCCGGCGCCACGCCGACCTGACCGCCCTGGTCGACGGC 
HEAVGDAARRHADLTALVDG 

62521 GACACCCGGATGACCTACCGGCGACTGACCGAGCACGCCCGGCGCGTCGGCCGCACGCTG 
DTRMTYRRLTEHARRVGRTL 

62581 CGCCGCCTCGGCGCCCGCCCCGGCCGCCTGGTCCCGGTGGTCGCCCGCAAGGGGTGGCGG 
RRLGARPGRLVPVVARKGWR 

62641 CAGGCCGTCGCCGCGCTGGGCGTCCTGGAGTCGGGGGCGGCGTACCTGCCCCTGGACCCC 
QAVAALGVLESGAAYLPLDP 

62701 GAACTGCCCGCCGAACGGCTCGTCCACCTCGTACGGCGCGCCGAAGCCGCCCTCCTCCTC 
ELPAERLVHLVRRAEAALLL 

62761 ACCGAACGCGCCCTGCTGGACACGCTCGCCGTCCCCGTCGGCGTCACCGTGCTCGCGGTG 
TERALLDTLAVPVGVTVLAV 

62 821 GACGACGACGCGGCCCTCGACGCCGACGGCGGCCCGCTGCAGAGCGTGCAGAACCTCACC 
DDDAALDADGGPLQSVQNLT 

62881 GACCTGGCGTACACCATCTTCACCTCGGGCTCCACCGGCG7VACCCAAGGGCGTCATGATC 
DLAYTI FTSGSTGEPKGVMI 

62941 GACCACCTCGGCGCGGCCAACACCCTGGAATGCGTCAACCGCCGCTTCGGCACCGGCCCC 
DHLGAANTLECVNRRFGTGP 
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63001 GGCGACGCGGTCCTCGCCGTCTCCTCCCCGAGCTTCGACCTCGCCGTCTACGACCTGTTC 63 060 
GDAVLAVS S PS FDLAVYDLF 

63061 GGCGTGCTGGCCGCCGGCGGCACCGTGGTCGTCCCCGCCCACGACCGCCGGCGCGACCCC 63120 
GVLAAGG TVVVPAHDRRRDP 

63121 GGACACTGGGCCGAGCTGATCCGGCGCGAGCGGGTCACCCTGTGGAACTCCGTCCCCGCG 63180 
GHWAELI RRERVTLWNSVPA 

63181 CTGGGCACCCTGCTCACCGAGTACGCCGAGGCCCTCGCCCCCGACGCCCTGCGCACCCTG 6324 0 
LGTLLTEYAEALAPDALRTL 

63 241 CGGGCGGTGCTCCTCAGCGGCGACTGGATCCccct cggactgcccgaccGGATCCGCGCC 63300 
RAVLLSGDWI PLGLPDRIRA 

63 301 CTGTCCGCCCCCGGCGCCACCGTGATGAGCCTCGGCGGCGCGACCGAAGCCTCCATCTGG 63 360 
LSAPGATVMSLGGATEAS IW 

633 61 TCGGTCTGGTACGAGATCGGGAAGGTGCACGAGGCGTGGAGCAGCATCCCCTACGGCACC 63 42 0 
SVWYEIGKVHEAWSSIPYGT 

63421 CCCATGGCCAACCAGCGGCTGGAGGTCCTCGACGAGCAGCTGCGGCCCCGGCCCGACTGG 63 4 80 
PMANQRLEVLDEQLRPRPDW 

6 34 81 GTGCC CGGCGAGCTGTACATCGGCGGCACCGGCGTCGCCAAGGGCT ACTGGCGCGACCCG 63 540 
VPGELYIGGTGVAKGYWRDP 

6 3541 GAACAGACCTCCCTGCGCTTCCCCGTCCACCCGGGCAGCGGGCAACGCCTGTACCGCACC 63 60 0 
EQTSLRFPVHPGSGQRLYRT 

63 601 GGGGACTTCGCCCGCCACCTCCCCGACGGCACGCTGGAATTCCTGGGCCGGCAGGACGAC 63 660 
GDFARHLPDGTLEFLGRQDD 

63 661 CAGGTGAAGATCGGCGGATTCCGGGTCGAACTGGGCGAGGTCGAGGCGGCCCTCGGCCGA 63 720 
QVKIGGFRVELGEVEAALGR 

63721 CTGCCCGACGTCGCCGCCGGCGCGGTGATCGCCACCGGTGACCCGCGGGGCGACCGCCGC 63780 
LPDVAAGAV IATGDPRGDRR 

6 3781 CTCGTCGGCTTCGCCGTACCGGCCCGGGAGGGCGGCTTCGACGCGGCCGGGCTCCGACGG 63 84 0 
LVGFAVPAREGGFDAAGLRR 

6 3 841 CAACT CG C C CGG CGG C TG C C CGC CT ACATGGT C C C CACGAC C C TG CTG CC C C TGG AC CGG 63 900 
QL.ARRLPAYMVPTTLLPLDR 

63 901 CTGCCGCTGACCGCCAACGGCAAGGTCGACCGGGCCGCACTCCAACGCCTCGTCCCCGGC 63 960 

LPLTANGKVDRAALQRLVPG 

6 3 961 CGCGCACCGGCCCCGGCGGAACCCGCCACCGCCCCACCTGCCCGTTCCCGCGCCGTCCCC 64 020 
RAPAPAE PATAPPARSRAVP 

64 021 GTGCCCGGCTGGCTCGCCGACCTGTGGTGCGAACTCCTCGACGTGCCGGAGGCCGACCCC 64 08 0 

VPGWLADLWCELLDVPEADP 

64 081 GACGCGAACTTCTTCGCCCTCGGCGGCACCTCCCGGGTCGCGATCACCCTGGTCACCCGG 6414 0 
DANFFALGGTSRVAITLVTR 

64141 ATCGAGGCCCGACTCGCCGTCCGGGTGCCCCTCGCCCGCCTCTTCGACGCCCGCACCCTG 64200 
I EARLAVRVPLARLFDARTL 

64201 GGCGGCCTCGCCGAGACGATCGCCGAACTGTCGGCCGCCGCCGAGGAGGAGCCGGCACCC 64 260 
GGLAETIAELSAAAEEEPAP 

64261 GCCGAGCCCGTGTACGCCCCCGACCCCGCCACCCGCCACGAGCCGTTCCCGCTCACCGAC 64 320 
AEPVYAPDPATRHEPFPLTD 

64321 ATCCAGCGCGCCTACTGGCTCGGCCGGCACCGCTCCCTCTCCCTTGGCGGCGTCGCCACG 64380 
IQRAYWLGRHRSLSLGGVAT 

64 3 81 CAC AC CT AC CT CG AACT CGACGT CG AGGAC CTCGAC C C CGG C CGG CT C C AGACGG C C CT C 6444 0 
HTYLELDVEDLDPGRLQTAL 

64441 CGCCGGCTGATCGACCGCCACGACGCCCTCCGGCTCGTGGTCCTCCCCGACGGCCGGCAA 64 500 
RRL I DRHDALRLVVLPDGRQ 
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64501 CAGATCCTCGGCGACGTACCGCCGTACCTCCTCGCCCACACCGACCTGCGGGGCAGGGCG 64 56 0 
QILGDVPPYLLAHTDLRGRA 

64 561 GACGCCGAGGCCGAACTGGCCCGCGTCCGCGAGCACATGTCGCACGAGGTGCGCGACGCC 64 62 0 
DAEAELARVREHMSHEVRDA 

64 621 TCCCGCTGGCCGCTGTTCGACGTACGGACCCACCGCCTGGACGACGTCCGCACCCGGCTG 64 680 
SRWPLFDVRTHRLDDVRTRL 

64 681 CACCTGAGCTTGGACCTGCTCATCGCCGACGCCCACAGCGTCCACGTACTCACCGGCGAC 64 74 0 
HLSLDLLIADAHSVHVLTGD 

64 741 CTGCTCACCTTCTACGCCGACCCCGACGCGGCCCTGCCGCCCCTCGGCTGCTCCTTCCGC 64800 
LLTFYADPDAALPPLGCSFR 

64 8 01 GACTACGTCCTGGCCGTCCGCGCCCACGCCGAGGGCGAGCCGCGCCGCCGCGCCCTCGAC 64860 
DYVLAVRAHAEGEPRRRALD 

64 861 CACTGGCGGGCCCGGCTGGCCGACCTGCCGGGCCCGCCCGGCCTGCCGCTGCGGTGCCGG 64 92 0 
HWRARLADLPGPPGLPLRCR 

64921 CCCGAGGAGCTGACCGCGCGGCGGTTCGCCCGCCTCACCACCGGACTCGGCCCCGACGCC 64 98 0 
PEELTAPRFARLiTTGLGPDA 

64 981 TGGGCACGGCTGCGGCGCGCCGCGGCGGCCGCCGAACTCACCCCGGCCGCACTGATCTGC 6504 0 
WARLRRAAAAAELTPAALI C 

h*£ ...... 

=.p 65041 GCCGCCTTCTGCGACGTCCTCGCCCAGTGGAGCGACACCCCCCGCTTCACCCTCAACCTC 65100 

AAFCDVLAQWSDTPRFTLNL 

* -^1 65101 ACCACCTTCCACCGCCCCGCCCTGCTCCCCGGCGTGGACGACCTCGTCGGCGACTTCACC 65160 

'"■•J TTFHRPALLPGVDDLVGDFT 

. .7* ...... 

65161 ACCACGACCCTGCTCGGGGTCGACGGCGAGGGGGACACCTTCCGGGACCGGGCCCGCCGA 6522 0 
Q s TTTLLGVDGEGDTFRDRARR 

[, ~ 65221 CTCCAGGACCGCATCTGGGAGGACCTCGAACACCGCGTCGTCAGCGGCGTCGAGGTCCTG 6 52 8 0 

^ LQDRIWEDLEHRVVSGVEVL 

...... 

652 81 CGGATGCTGCGCCGCGAGCGGGGCACCCACGACGCCGTCCGGATGCCGGTCGTCTTCACC 6534 0 
J**j RMLRRERGTHDAVRMPVVFT 

Ml 65341 AGCACCCTGCGGGCCGCCGGCCCCGCCCCCCGGACGGCCCCGCCCGCCTGGCGGGTACGG 65400 

fj STLRAAGPAPRTAPPAWRVR 
t-^ ...... 

w 654 01 CCCGGCTACGCGATCAGCCAGACCCCGCAGGTCCTGCTCGACCATCAGGTGAGCGAGAGC 6546 0 

PGYAI SQTPQVLLDHQVSES 

65461 GACGGCCGACTGGTCTGCACCTGGGACTACGTCGCGGACGCCTACCCGCCCGGGCTGATC 6552 0 
DGRLVCTWDYVADAYPPGLI 

65521 GAGGCCATGTTCGGGGCCTTCGAGGCGCTCCTCGCCTCGCTCGCCGGTCACGACGACGAC 6558 0 
EAMFGAFEALLASLAGHDDD 

65581 GCCGGCCACGACGACGACGCCGGCCACGACGACGGCCCCGGCCACGACGACGGCCCCGGC 6564 0 
AGHDDDAGHDDGPGHDDGPG 

65641 CACGACGACGGCCCCGGCCACGACGACGGCCCCGGCCACGACGACGGCCCCGGCCGCGAC 65700 
HDDGPGHDDGPGHDDGPGRD 

657 01 GACAGTGCCGATCACGGCCACAGTGCCACGCACGACGACAGCGCCGCCCGAAACGACAGA 6576 0 
DSADHGHSATHDDSAARNDR 

657 61 GAGGGAGGTGGACCGGAGTGACGAGCGCCCGGCCCACGCCGACACTGCTCCCCGCCGACC 6582 0 
E G G G P E * 

MTSARPTPTLLPADQ (orfl6) 

65821 AGCGGGAGCTGCTGCGGATGATGAACGACCGCACCGCACCCGTGCCCGCGCACACCCTCA 65880 
RELLRMMNDRTAPVPAHTLT 

65881 CCGCCCAACTGGCCGACGCCGCGCGCACGCACGACCGGGCTCTGGCACTGGTGGCACCGG 6594 0 
AQLADAARTHDRALALVAPG 
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65941 GTCTG ACACTGAGCCACGCCGAACTGGACGCCCGGGCGGCCGCGGTGGCCGCCCGGCTCA 66 000 
LTLSHAELDARAAAVAARLT 

66001 CCGCCGCGGGCGTCATCCCCGGGGACCGGGTCGCCCTCGCCGTCGAGTACGGCTGGGAGC 66 060 
A A G V I PGDRVALAVEYGWEQ 

6 6061 AGGTCGTGGGCGCCCTGGCCGCGCTCCGCGCCGGAGCCGTCTGCCTGCCCGTCGCCCCCG " 66120 
VVGALAALRAGAVCLPVAPG 

66121 GGCTGCCCCGGCCCGCCCGCTGGCAGCACGCCACCCGGGCCGGGGCGACGGCCGTCCTCA 66180 
LPRPARWQHATRAGATAVLiT 

66181 CCCAGTCCTGGCTCACCCAGCGCATCGACTGGCCGCAGGAACTGCCCGTCCTCTCCGTGG 6624 0 
QSWLTQRIDWPQELPVLSVD 

66241 ACGAACCCGGGCCGCCGGTACCACCCACCACCGCC CCGGCCGACGG ACGGTCCGCGACCG 663 00 
EPGPPVPPTTAPADGRSATD 

663 01 ACGCCGCCTACCGGCTGGACGCCCCCGTCAGCCACCGCGCGATCACCACCGCCGCCCTGG 663 60 
AAYRLDAPVSHRAI TTAALE 

6 6361 AGATCGACCGCGCCTTCCGCGTCGGACCCGGCGACCGGCTCCTGGCCCTGGCCCCCGCCG 66420 
I DRAFRVGPGDRLLALAPAD 

6 6421 ACTCGCCGCTCGCTCTCTACGAACTGTTCGGGCCCCTCCTGGCCGGTGCGGCCCTCGTCC 664 80 
SPLALYELFGPLLAGAALVL 

Q 66481 TCACCCGGGACATCGACCTGCGCGATCCCGGAGCCCTGCACGAGGCGCTGCGCACCCACG 66540 



TRDIDLRDPGALHEALRTHG 

66541 GCGTCACCCTCTGGCACTCGCCGCCCGCCCTCCTCGGCCTCCTCCTCGACCACCTCGCCG 66600 
VTLWHSPPALLGLLLDHLAD 

6 6601 ACCGGGGCGGCAAACTGCCCGAGTCGCTCCGGCTGGTGCTGCTCGGCGGCGAACGCCTCG 66660 
RGGKLPESLRLVLLGGERLD 

66661 ACCCCGCCCTCGTCCGCCGCGTCCGCGAGAGCGCCCCGCACCAGCCGGCCGTCGCCCACC 66 72 0 
PALVRRVRESAPHQPAVAHL 

66721 TCTCCTCGGCCACCCCGTCCGGCCCCTGGACCACCTGCCTGGAGACCGGCGACCTCGCCC 667 8 0 
SSATPSGPWTTCLETGDLAP 

66781 CGGAATGGCGCTCGGTCCCCGTCGGCGCGCCCCTGCCCAACCAGCGGGCGCACATCCTGT 66 84 0 
EWRSVPVGAPLPNQRAHILS 

66841 CCGAGACCCTGCGGCCCTGCCCGGTCTGGGTCACCGGCCGCCTCCACTACGGCGGCGTCG 66 900 
ETLRPCPVWVTGRLHYGGVA 

66901 CCGCCGAGCCCCCCACCGGAGAGGAGCACGCACCCGCGACCGTCCCGCACCCGGAGACCG 66 96 0 
AEPPTGEEHAPATVPHPETG 

66961 GCGAACCGCTGCTGCGCACCGGGCTGTTCGCCCGCCTGCTGCCCGAGGGCCTGATCGACG 67 02 0 
EPLLRTGLFARLLPEGLIDV 

6 7021 TCGTCGGCGACGAGACCGCCCGGATCAGCGTCCGCGACCGGCCCCTGAACCTCCAGGACA 67 080 
VGDETARI SVRDRPLNLQDT 

67081 CCGAGACCGCCCTCGCCGCCCACGAGGACGTGCACTCCGCCGTGGTCGTCCCCGTCGGGC 6714 0 
ETALAAHEDVHSAVVVPVGR 

6 7141 GGGGAGACGAGTCGCTCGCGCGGGTACGGCTCCACCCCGGCGCCACGGCCGGCCCCGACG 672 00 
GDES LARVRLH PGATAG PDE 

67201 AACTCCTCGCCCATCTGCGCCGCAAGGTCTCCCCTTACCTGCTGCCCGGCCACATCGAGG 672 60 
LLAHLRRKVSPYLLPGHIEV 

67261 TGGG CGGT C CG CTG C CG CTC AC C CGGG ACGGG CGCGTGG AC CG CG CG CG CGT C AC CG C CG 67 320 
GGPLPLTRDGRVDRAR VTAE 

67321 AGGCCCCCGCCCCCGCTGCCGTGCCCGCCGCCGCGCCGGCGGCGTCGGCACCCGCGCGGG 67 38 0 
APAPAAVPAAAPAASAPARD 

673 81 ACG AGGC CG AACT CCTCG C C CAAGTGGC C CGGGTGAC CTG CCGGGTG CTGGG AAT CGG CG 6744 0 
EAELLAQVARVTCRVLG I GA 
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67441 C CGT CG AAC C CG AT ATGAAC CTGC T CGACGCCGGTG C CAC CTC CGT CG AACT CGT C CG C C 67 500 
VEPDMNLLDAGATSVELVRL 

67501 TGGCGACCGCTCTGGAGGAGGAACTCGGCCTCGACACCGACATCGAGGAACTGCTGGCCT 67560 
ATALEEELGLDTDIEELLAF 

67561 TCCCGTCGGTCGCCGTGATCGTCGGCCGCCACCTCGGCCGCCGGACGGCACCACCGGCCC 67620 
PSVAVIVGRHLGRRTAPPAR 

67621 GGGACCCCCTGCCGCCCGCGTCCGTAGCGTTCGCACCCGGGTCCGTACTGCCCGCGCCGC 67 680 
DPLPPASVAFAP GSVLPAPP 

67681 CCGCGCCCGGACCCGTGCCGCCCGCGTCCGTGCCGCCCGCACCCGCGTCCGTACCGCCCG 6774 0 
APGPVPPASVPPAPASVPPA 

67741 CGTCCGAGTCCTCACCGCTCGCGCCGCCCGCACCCGGGCCCGTGCCACCCACGCCCGTCC 67800 
SESSPLAPPAPGPVPPTPVP 

67 801 CGCCCGCCTCCGTCCCGCCCGCGTCCGGGGCCGCGCCGCACGTACCGCCCGCGCCGCCCG 6786 0 
PASVPPASGAAPHVPPAPPA 

67861 CACCCATCCCCGCGCCCTCCGTGCCccccgcgccccgcccccaaccgcccctgctcaccg 6792 0 
PIPAPSVPPAPRPQPPLLTG 

67921 gcatcggcgcccgccaggcgTTCAAGGACGCCCACCACGGCATCCGGCACGAGTTCGACG 6798 0 
IGARQAFKDAHHGIRHEFDA 

67981 CCACCGACGGCGTCGCCCTCAGCGGCCCGGACGACCACCACCTCACCGCCCGTCGCAGCC 68 04 0 
TDGVALSGPDDHHLTARRSH 

68041 ACCACCGCTTCGACCCCGGCCCCGTGACGCTGCCGGACCTGGCCGCCCTCCTCGGGGCCC 68100 
HRFDPGPVTLPDLAALLGAL 

68101 TCCGCCGGGTCCGCGGCCCGGGAGGCGAACCCAAATACGCCTATCCGTCGGCCGGTTCCT 68160 
RRVRGPGGEPKYAYPSAGSS 

68161 CCTACCCCGTCCAGACCTACCTGCTCGTCCACCCGGGGAAGGTGACCGGACTGCCCGGCG 6 8220 
YPVQTYLLVHPGKVTGLPGG 

68221 GCAGCCACTACGTCCACCCCGCGCGCAACCGCCTGGTGAGCATCGACCCCACCGCGACCC 6 8280 
SHYVHPARNRLVS IDPTATL 

682 81 TGCCCGCCGACGCGCACGCCGAGATCAACCGCGCCGCCTACGGGGAGGCGGCCTTCTCCC 6834 0 
PADAHAE I NRAAYGEAAFS L 

68341 TCTACCTCATCGCCGCGATCGACGCGATCACACCGCTCTACGGCGATCTCTCCTGGGACT 684 00 
YLIAAIDAITPLYGDLSWDF 

684 01 TCACCGTCTTCGAGGCCGGTGCCATGACCCAGTTGCTGATGCGGACCGCCGTCGGCACCG 684 60 
TVFEAGAMTQLLMRTAVGTG 

684 61 GCATCGGCCTGTGCCCCGTCGGCACGATGGACCCCGCGCCGCTGCGCCGCGCGTTCGCCC 6852 0 
IGLCPVGTMDPAPLRRAFAL 

68521 TCACCGACCGGCACCGCTTCGTCCACGCCCTCCTCGGCGGGCGGCCCCGCACGGAGGCCC 6858 0 
TDRHRFVHALLGGRPRTEAP 

68581 CGTGAACCGGCACGGCCCCCTGGCGGGCCGGCGGCAGAGCGTCGACACCCGCAGCGCCGC 68640 

MNRHG PLAGRRQSVDTRSAA (orflS) 
* 

68641 GTGGGTGGCGCCGACGGGCACCCCGGGGCTGCCGCTGGAGGTGGCCGCCACCCGGGACGG 687 00 
WVAPTGTPGLPLEVAATRDG 

68701 CGTCGACCCGGCCGAATGGGCCCGCACCCACCTCGACACCGTCACCGGCTGGCTGCACCG 687 6 0 
VDPAEWARTHLDTVTGWLHR 

68761 TCACGGAGCCGTCCTGTTCCGCGGCTTCGGCGTCGGCCTCGACGGCTTCGGCGACGTCGT 6882 0 
HGAVLFRGFGVGLDGFGDVV 

68821 CCACGCCCTGGCCGGATCCCCCGAGGCGTACGTCGAACGGTCGTCGCCGCGCACCGCCCT 68880 
HALAGSPEAYVERSSPRTAL 
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68881 CGGGCATCACCTCTACACCGCCACCGACCACCCCGCCGACCAGCCCATCCCCCCGCACAA 6894 0 
GHHLYTATDHPADQPIPPHN 

68941 CGAGAACTCCTACCAACTCCGCTTCCCCGGACGGCTGGTCTTCGGCTGCCTCACCCCGGC 6900 0 
ENSYQLRFPGRLVFGCLTP .A 

69001 CCGGACCGGCGGCGCGACCCCGCTCGCCGACACCCGGCGCGTCCTGGGCCGCCTCGACCC 6906 0 
RTGGATPLADTRRVLGRLDP 

69061 CGCCCTCGTCGCCGCCTTCGCCCGCCGCGGGGTGCTCTACCAGCGCAACTACGGCGACGG 6912 0 
ALVAAFARRGVLYQRNYGDG 

69121 GATCGGCATGTCCTGGCAGGACGCCTTCCAGACCCGCGACAAGGCGGCCGTCACCGCCTA 6918 0 
IGMSWQDAFQTRDKAAVTAY 

69181 CTGCGCCGCCCGCCGCGTCGACGTCGAATGGAAACCCGACGGCGGGCTGCGGACCACCCA 6 924 0 
CAARRVDVEWKPDGGLRTTQ 

69241 GGTCCGCCCCGCCCTCGCCGTCCACCCGGCGACGGGGGAGCGGGTGTGGTTCAACCACGC 6 93 0 0 
VRPALAVHPATGERVWFNHA 

693 01 CGCGTTCTTCCACGTCTCCGCCCGGCCGCCCGCGCTGCGGGACGCCCTGCTGGCCCAGTT 6 9360 
AFFHVSARPPALRDALLAQF 

69361 CGACGAACGCGACCTGCCGAGCCACTCCTGCTACGGCGACGGCCGGCCCATCGAACCCGC 69420 
DERDLPSHSCYGDGRPIEPA 

6 9421 CGTCATGGAGGAACTGCACCACGCCTACGCCGCCGAACTGGTGGCGCCCGCCTGGCGGGC 6 9480 
VMEELHHAYAAELVAPAWRA 

6 9481 CGGCGACGTCCTCCTCGTCGACAACCTCCTCACCGCGCACGGCAGGGAACCCTTCACCGG 6954 0 
GDVLLVDNLLTAHGREPFTG 

69541 CGAACGCCGCGTCGTCGTCGGCATGGCACAGCCGCTGGACTGGGACGAGGTGAGCGCGTG 6960 0 
ERRVVVGMAQPLDWDEVSA * 

M (orfl4) 

6 96 01 ACCGCCCCCGGCACACCGCTGCCCGCGACCTTCGTCCAGCGCGGCCTGTGGCCGTCCACT 6 9660 
TAPGTPLPATFVQRGLWPST 

69661 CGCCACGCCCGCCCGGCGGAGGTCACCCACGTCCGCGCCCTGCGCCTGACCGGGGACACC 6 972 0 
RHARPAEVTHVRALRLTGDT 

69721 GACACGGCGCGGCTCACCGAGGCCGTCCGGCGGGTCACCGCCGCCCTCCCCGCCCTCACC 6 9780 
DTARLTEAVRRVTAALPALT 

6 9781 GCCGAACTCTCCGGCGACGAGGAACCCCGCCTGACCCTCCGGCCGGACGCCCCCGAGGTC 6 984 0 
AELSGDEE PRLTLRPDAPEV 

69841 ACCCCGGTCGACCTGCGCGGAGCCCCGTCCGCCGGACGCGACGCCGTCTGCGTGGCGCTG 6 9900 
TPVDLRGAP SAGRDAVCVAL 

6 9901 CTGCGCGCCGACCGGGACCACCCTCGCGCCGGACGCCACCGGGCCCGCTTCCACCTGGTG 6996 0 
LRADRDHPRAGRHRARFHLV 

6 9961 CGGCTCC7VCGACGACGAGACGGTGCTCGCGCTCACGGCCCACACCCTCCTCCTCGACACA 7 0 02 0 

RLHDDETVLALTAHTLLLDT 

7 0021 CCGTCTCTCTACGCCGTGCTCGGCGCGGTCTGCCAGGCGTACGCCGGCCGCTTCCGCCCC 7008 0 

PS LYAVLGAVCQAYAGR FR P 

70081 GAGCACTACCGCGACGCCACCACCCTGCCCGACGCGCCCCACGCCCCCCTCTCCGGTCGG 70140 
EHYRDATTLPDAPHAPLSGR 

70141 GCCCGGGCCTCCCGCCGGCGCTGGTGGCACCGGCGCCTGGCCGCCCTGCCCGGCCCGGCC 702 00 
ARASRRRWWHRRLAALPGPA 

702 01 CCGGCCCCCGCCGGCCCGCCCCGCGACCGGGTGACCGAAACCCACCGGCTGCGCATCCCC 702 60 
PAPAGPPRDRVTETHRLRIP 

702 61 GCAGCGCGCTGGAAAGCCCTGACCGCCCTGACCGCCCTGGGCGGCCCCCTCGGCGGCAAC 7032 0 
AARWKALTALTALGGPLGGN 

70321 GGCTCGCTCGCCGTCATGGCCCTGGCCGCCTGGTGCCTGCGCGCCCCGGACCACCGGGGA 703 8 0 



35 





GSLAVMALAAWCLRAPDHRG 

7 0381 CCGGCCCGCTTCACCACCGTCGTCGACCTGCGCGACCACCTCGGACTCGGGCCCGCCGTC 70440 
PARFTTVVDLRDHLGLGPAV 

70441 GGCCCGTTCACCGACCGCCTCGTCTTCGGCGCCGACCTCGGCGAAGCGCCGCGCCCCTCC 705 00 
GPFTDRLVFGADLGEAPRPS 

7 0501 TTCCGGGACGTCACGCTGCGCGCCCAGTCCGGGTTCCTGGACGCCGTCGTGCACTACCTC 70560 
FRDVTLRAQSGFLDAVVHYL, 

7 0561 CCCTACGGCGACGTCGTGGAACTCgGCAGGGAACTGGGCCGCGTCACCGCGCCCCGCACC 7062 0 
PYGDVVEL G RELGRVTAPRT 

7 0621 GCCGCGCACTGGGACGTGGCGCTGAACTTCTGCCGCAACCCGCCCACCAGCGCCGCCACC 706 8 0 
AAHWDVALNFCRNPPTSAAT 

70681 CGCGGCGAACGCACCCTCGCCGAACGCGGCCTGTCCATCGAGCTGTTCCGCGAGGCCGAC 7074 0 
RGERTLAERGLSIELFREAD 

70741 CTGCTCGGCGCGGCCGGCACCGGTCCCGCGCACCGGTGGGACGGCACGGTGCTCGCCCTC 708 00 
LLGAAGTGPAHRWDGTVLAL 

7 0801 TCCCTAGGCGAACTCGGCGACGACACCGTGCTGGTCCTCGACGCCGACCGCGACCACCCG 708 60 
SLGELGDDTVLVLDADRDHP 

7 0861 CACCACGGAACCGCCGACCGGCTGCTCCACCGGATGGACGAAGCGCTCCTGGCGGCCGTC 70 92 0 
HHGTADRLLHRMDEALLAAV 

S S 70921 GCCGACCCGGACGCCCCCCTGCCCCCCTTGCCCGCCCCCGCGCACACCACGAGGAGCCAC 70 980 

4" ADPDAPLPPLPAPAHTTRSH 

- _1 7 0981 CGATGACCACGACCCCGCGGACCGCCGCCGAGCCCACCTACCACGTGGTGGTCAACGACG 7104 0 

J MTTTPRTAAE PTYHVVVNDE (orfl3) 

r * 

i;n ...... 

r|j 71041 AGGAGCAGTACTCGATCTGGCTCGCCGAACAGGAGATCCCGGCCGGCTGGCGGGCCACCG 71100 

EQYS IWLAEQEI PAGWRATG 

Q 71101 GAACCTCCGGCACCCAGGAGGAGTGCCTGCGCCACATCGACGAGGTGTGGACCGACATGC 71160 

j*^ TSGTQEECLRHIDEVWTDMR 

j *3 71161 GCCCCCGCAGCCTGCGCGAGGCCATGGCCGCGGCGGAGCACGCGGAGCCCGCTCCCGCCC 712 2 0 

In PRSLREAMAAAEHAEPAPAP 

: 5 3B, ..... 

71221 CGGCCCCGGCCGAGGAGGAGCCGAGCCTCGTCGACCGGCTCTGCGCGGGCGACCAGCCGG 712 80 
APAEEEPSLVDRLCAGDQPV 

712 81 TGGAGTCGGTCCTCCGCCCGGAGCGCACGGCCGCCGCCCTGCGGGAGGCCGTCGACCGCG 7134 0 
ESVLRPERTAAALREAVDRG 

71341 GCTACGTCTTCGTCCGCTTCGCCGCCACCCGCGGCGGCACCGAACTCGGCGTCGCCGTCG 714 00 
YVFVRFAATRGGTELGVAVD 

714 01 ACCCCGCGGCGACCACCATGGACGGCACCGAGCTGCGCCTGACCGGCACCCTCACCCTCG 714 6 0 
PAATTMDGTELRLTGTLTLD 

71461 ACTTCGAACCGGTCCGCTGCCACGCCCGCGTCGACGTGACCACCTTCACGGGCGAGGGCC 7152 0 
FEPVRCHARVDVTTFTGEGR 

71521 GCCTGGAGCGCGTGTCCGGCACCTGACCCCCGCCGGCCACCCGGCCGTGAGGCGCGGCTC 71580 
LERVSGT* 

71581 GGGACCGGGCCGCCGACCCACCGAAGGGAGGGACCCCATGACCACCCCCATGACCACCCC 71640 

MTTPMTTP (orfl2) 

71641 CACGACCACCCGCACCACCACCCGCACCGCCGTCTTCGCCCACCTCCGCGCCCCCGGCCT 717 00 
TTTRTTTRTAVFAHLRAPGL 

71701 CGGCGACCTCCTCCAGCGCAACATCGGCCTCGCCCTCGTCCGCCGCGCCCGCCCGGCGAC 7176 0 
GDLLQRNIGLALVRRARPAT 

71761 GGCGGTCACCCTGGTCGTCGGCGAGGACCTGGCGGCCCGCTTCGGTCCGGCACTCACCCG 71820 
AVTLVVGEDLAARFGPALTR 
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71821 CCACACGTACGCCACCGACGTGCTGCCCTGCCCCCAGCGGGGCGAGGCCGACCCCCGGTG 7188 0 
HTYATDVLPC PQRGEADPRW 

71881 GCCCGCCTTCCTGCGCACCCTGGCCGACCGCCGCTTCGCCCTCGCCGTCGTCGACCCGGA 71940 
PAFLRTLADRRFALAVVDPD 

71941 CAGCCAGGGCCTGCACGCCGGCCACGCCCGGGCCGCCGGCGTGCCCGAGCGGATCGGCCT 72 000 
SQGLHAGHARAAGVPER IGL 

72 0 01 GCCGCAGGACCGGCCCGGAGACGAACACATCACCCATCCCATCCGCCTCCCACGTCCCCT 72 06 0 
PQDRPGDEHITHPIRLPRPL 

72 061 GTGGGGGACCCCGGACCTGTACGAGTACGCCACTGCCCTCGCCGCCGCGCTGGGCCTGCC 72120 
WGTPDLYEYATALAAALGLP 

72121 CGCACCGCCGCGCCCCGGCGACGTCCTGCCGGAGCTGCCCCGCACCCGCGGCGTCCGCCC 72180 
APP RPGDVLPELPRTRGVRP 

72181 GCCGACGGCCGGTCTGCCCCGTCCGCTCGTCGCCGTCCACCCCGGCGGGGCACCGCACTG 72 240 
PTAGLPRPLVAVHPGGAPHW 

72241 GAACAGGAGATGGCCGCTCGAGCACTACGCCCGGCTCTGCGCCCGCCTCGCGGCCGAACT 723 00 
NRRWPLEHYARLCARLAAEL 

723 01 CTCGGCCTCCCTCTGCCTGCTGGGCGACGAAGCCGAACGCCCCGAGCTGGAACTGCTCCG 72 3 60 

SASLCLLGDEAERPELELLR 

[ =:a i: ...... 

s i 723 61 GCACGCCGTCCTGACGCGGTCCCCGCGAGCCGTCGTCCACCTCGAGGCGGGCGCGGACCT 72420 

HAVLTRS PRAVVHLEAGADL 

724 21 CGACCGGACCGCGAACGTCCTCGCCGACGCCGACCTGCTCGTCGGCAACGACTCCTCGCT 724 8 0 
!% J DRTANVLADADLLVGNDSSL 

[ M 724 81 CGCCCACGTCGCCGCCGCCGTCCGCACCCCGTCCGTCGTCCTCTACGGCCCGACCGGCAC 7254 0 

?¥\ AHVAAAVRTPSVVLYGPTGT 
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72541 CGAGTACCTGTGGACCAGGATCTACCCGTACCACCGCGGGGTCTCCCTGCGGTGGCCGTG 72600 
EYLWTRI YPYHRGVSLRWPC 

72601 CCAGCGGCTGCGGCACGCCGCAGGCGAACTCGCCGGCCGGCGGTGCGCGCACGGCTGCGT 72660 
QRLRHAAGELAGRRCAHGCV 

726 61 CCTGCCCTACCAGGGCCCGGCCGGCCCGTATCCGCGCTGTCTGGCCGACCTGCCGGTGGA 7272 0 
LPYQGPAGPYPRCLADLPVD 

72721 CAGGGTCTGGCCGGCGGTGACCGCCCGATGGGCGAGCCCCCACCCCGTGACGATCAGGAG 72780 
RVWPAVTARWASPHPVTIRS 

72781 TACCCCATGAGCGCCGACCCGTCCCGGGTGCGGACGATCCTCTCCGTCAACTTCAACCAC 72 840 
TP* 

MSADPSRVRTI LSVNFNH (orfll) 

72 841 G ACGG CT C CGG CGTGCTGTTG CGGG AGGG C AGG AT CG C CGG CT ACGT CAC CAC CG AG CG C 72 90 0 
DGSGVLLREGRIAGYVTTER 

72 9 01 CGCTCCCGCCTCAAGAAGCACCCGGGCCTGCGCGAGGAGGACCTCGACGAACTGCTGGAC 72 9 60 

RSRLKKHPGLREEDLDELLD 

72961 CAGGCCGGGGCCGACCTCTCCGACATCGACCACGTCATGCTCTGCAACCTGCACACCATG 73 020 
QAGADLS DI DHVMLCNLHTM 

73 021 GACACACCCGACATACCCCGGCTGCACGGCTCCGACCTCAAGGAGACCTGGCTCGCGTTC 73 0 8 0 

DTPDI PRLHGSDLKETWLAF 

7 3 081 TGGGTCAACCAGCGCAACGACGAGGTGAGCCTGCGCGGCCGCCGCATCCCCTGCACCGTC 7314 0 
WVNQRNDEVSLRGRRI PCTV 

7 3141 AACCCGGACCACCACCTCATCCACGCCGCCACCGCCTACTACACCTCCGGCTACGACTCG 732 00 
NPDHHLI HAATAYYTSGYDS 

732 01 GCGATGGCCGTGGCCATCGACCCCACCGGCTGCCGCGCCTTCGCCGGCAAGGGCAGCCGC 732 60 
AMAVAIDPTGCRAFAGKGSR 
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73261 CTCTACCCCCTGCGCCGCGACCTCGACGCCTGGTTCAACGCCAACATCGGCTACTGCTAC 7332 0 
LYPLRRDLDAWFNANIGYCY 

73321 GTCGCCGACCTGATGTTCGGCTCCAGCATCGTCGGCGCCGGCAAGGTCATGGGCCTCGCC 733 80 
VADLMFGSS I VGAGKVMGLA 

733 81 CCCTACGGCAGACCCGCCGACGGCGCCGGCCCCGACGAGGAACCGCCCGAGACCGTGCGC 7344 0 
PYGRPADGAGPDEEPPETVR 

73441 GACTTCGCCGCCCTGGTGGCCCTGGCCGACCGGCACCCGCGCCTCGTCGACGTCGACGGC 73 50 0 
DFAALVALADRHPRLVDVDG 

73501 AGGAAGCTCAACGCCACCCTCGCCCACTACATCCAGCTGGGCCTGGAACGCCAGCTGACC 73 56 0 
RKLNATLAHY IQLGLERQLT 

7 3 561 GCCGTCTTCGCCGAGCTCGCCCCGCTGTGCGCCCGCAACGGCATCGCACCGGACATCTGC 73 62 0 
AVFAELAPLCARNGIAPD I C 

73621 CTCTCCGGCGGTACCGCCCTCAACGCCATCGCCACCCAACTCGCCTTCGAGTCGACCGGC 7 3 680 
LSGGTALNAIATQLAFESTG 

73681 TTCGAGCGCATGCACCTCCACCCCGCCTGCGGCGACGACGGCACCGCGATCGGCGCGGCG 7 3 74 0 
FERMHLHPACGDDGTAIGAA 

73 741 CTCTGGCACTGGCACCACGTCCTGGGCCACCCCCGGCTCCACCACACCAACGCCGACCTC 73 80 0 

LWHWHHVLGHPRLHHTNADL 

i;3 73 801 ATGTACTCCGTCCGTGAGTACCCCGAGCACACCGTCCGGCGGGCCGTGCGGGACCACGCG 7 3 860 
MYSVREYPEHTVRRAVRDHA 

£ 73 861 GCCGACCTCGTCGTCGAGGAGACCGGCGACTACGTCGCCAGGGCCGCCGAACTGGTCGCC 73 92 0 
"vJj ADLVVEETGDYVARAAELVA 

'2 73 921 GGCGGCGCCGTCATCGGCTGGTACGACGGCGCCGGCGAGGTCGGGCCGCGGGCCCTGGGC 73 98 0 
l £f GGAVI GWYDGAGEVGPRALG 

j^s ■ 73 981 CACCGCAGCATCGTCGCCGACCCGCGCGACCCCGCCATGCGGGACCGGCTCAACTCCCAG 74 04 0 
^ HRS IVADPRDPAMRDRLNSQ 

3 74 041 GTCAAGTTCCGCGAACACTTCCGGCCcTTCGCGCCGTCCGTGCTCAAGGAGCACGCCGCG 74100 
i'l VKFREHFRP FAPSVLKEHAA 

1 3 74101 GAGTGGTTCGGCCTCTCCGACAGCCCCTTCATGCTGCGGGCCACCCCCGTCCTCAAGCCC 7416 0 
|J| EWFGLSDSPFMLRATPVLKP 

]d[ 74161 GGCGTGCCCGCCATCACCCACGTCGACGGGACGTCGAGGATCCAGTCGGTCACCCGCCAG 74220 
U GVPAITHVDGTSRIQSVTRQ 

74221 GACACCCCCGCCTTCCACGACCTCATCCACGCCTTCAAGGACCGTACGGGGATCCC CATG 74 2 80 
DTPAFHDLI HAFKDRTGI PM 

74281 GTGCTCAACACCAGCCTCAACACCAAGGGCGAGCCGATCGCGGAGACACCCGAGGACGCC 7434 0 
VLNTSLNTKGEP IAETPEDA 

74 341 CTGCGCACCCTGCTCGGCTCCCGGCTCGACCACCTGGTGCTCCCGGGCCTCATCGTCAGC 744 00 

LRTLLGSRLDHLVLPGLIVS 

74 4 01 GGCCGGACGGCGGCCCGCTCATGAGCGCCCCGCGGGGCGAGCGGACCCGGCGCCGCGCGC 744 60 

MSAPRGERTRRRAL (orflO) 
GRTAARS * 

74461 TCGAACGCGACATCGCCGCGATCTGGGCCGAGACCCTCGGCAGGGACAGCGTCGGCCCGC 74 520 
ERDIAAIWAETLGRDSVGPH 

74 521 ACG AGG ACTT CGC CG CG CTGGG CGG C AAC T C CAT C CACG C C ATC AAGAT CACC AAC CGGG 7458 0 
EDFAALGGNS IHAIKITNRV 

74 581 TGGAGGAACTCGTCGACGCCGAGCTGTCCATCCGCGTCCTGCTCGAGACGCGCACCGTGG 74 64 0 
EELVDAELS IRVLLETRTVA 

74 641 CCGGCATGACGGACCACGTCCACGCCACGCTCACGGGGGAGCGGGACCGGTGAACACCGA 74 70 0 
GMTDHVHATLTGERDR* 

M N T D (orf9) 



38 




74 701 CCTGCCCCGGCTGCTCGACCGGATCGCCGGCCTGCGCGTCCTCGTCATCGGCGACGTCAT 74 760 
LPRLLDRIAGLRVLVIGDVI 

747 61 CCTCGACACCTACGTCTGGGGAGCCACCTCGGGCCTGTGCCGCGAATCCCCCGTCCCTGC 74 82 0 
LDTYVWGATSGLCRES PVPA 

74821 CGTCACCCTGACCTCCGTCGCCCACCAGTGCGGCGGCGCCGCCAACGTCGCCGTGAACCT 74 88 0 
VTLTSVAHQCGGAANVAVNL 

74 881 CCGGGCGCTCGGCGCCGAACCGGTGCTGCTCTCCGCGACGGGTGACGACCGCGCCGGCCG 74 94 0 
RALGAEPVLLSATGDDRAGR 

74 941 CCGGCTGCGCGAAGCCCTCCGTGCGCGGGACGTCGACACCGGCGGACTCTTCGTACAGCC 75000 
RLREALRARDVDTGGL FVQP 

75001 CGGCCGGACCACGGTCACCAAACGCCGCGTCATGGCCGACGGACAGATGCTGCTCCGCCT 7506 0 
GRTTVTKRRVMADGQMLLRL 

75061 CGACGAGGGCGGCGAACACCCGTTGCCCGTGGCGACGGACACCGGAAGCCGCCTGCTCGA 7512 0 
DEGGEHPLPVATDTGSRLLE 

7 5121 ACGGGCCGCCGGCCTGCTGCCCGCCGTCGACGCCGTGATCGTCTCCGACTACGGGTACGG 7518 0 
RAAGLLPAVDAVIVSDYGYG 

7 5181 CGTGTGGGAGCCCGACACCGTCGCCCGGCTCGCCGCACACCGCGAACTCGGCCCGTCCAC 7524 0 
VWEPDTVARLAAHRELGPST 

O 75241 CCTGGTCGTCGACTCCCGCCGGCCCGCGCGCTTCACCGCGCTGCGGGCCAGCGCCGTCAA 753 00 

LVVDSRRPARFTALRASAVK 

...... 

^ 753 01 ACCCAACCACGCGGAGGCGCTGCGCCTGCTCGACGCCGGCGAACCCCCGCCCGGCCCGGC 753 60 

H PNHAEALRLLDAGEPPPGPA 

: ;S 7 5361 CAGGGCGGACTGGGCGGCCGCCCTCGGCGACCGGCTCCTGCGCCTGACGGGAGCCGAACG 75420 

jiff RADWAAALGDRLLRLTGAER 

Til 75421 GGTCGCCCTCACCCTGGACGCCGACGGATCACTGCTCTTCGAACGCGACCGGCCCCCGGT 754 8 0 

VALTLDADGSLLFERDRPPV 

• xas. ...... 

J 75481 CCGCACGTTCGCCCGGGGCAGCCGGGCACCGGTCACGGCCGCCGTCGGCGCCGGCGACGC 75540 

1^ RTFARGSRAPVTAAVGAGDA 

! s ;f 7 5541 CTTCACCGCGGCCCTCACCCTCGCCCTCGCCGCCGGCGCCGACTCCGCGGTCGCCGCCGA 7 560 0 

U.I FTAALTLALAAGADSAVAAE 

j=E 75601 ACTGGCCTCCGCCGCCGCCGGCACGGCCGTCGCCACCCCCGGCACCAGCACCTGGCACGC 7566 0 

W ~ LASAA'AGTAVATPGTSTWHA 

75661 CGACGAACTGCGCCGACTGCTCGGCGGCACCGGCAAGGTCTGCCGGACCGGCACCCTGCC 7572 0 
DELRRLLGGTGKVCR. TGTLP 

75721 CGCCCGGCTGCTCGACCCGGCCGCCCGCGACCGCCGGGTCGTCTTCACCAACGGCTGCTT 7 5780 
ARLLDPAARDRRVVFTNGCF 

75781 CGACCTCCTGCACGGCGGCCACGTCTCCTGCCTGAGCCGGGCCAAGGAACTGGGCGACCT 7584 0 
DLLHGGHVSCLSRAKELGDL 

75841 GCTCGTCGTCGGCGTCAACTCCGACGCGAGCGTCCGACGCCTCAAGGGCCCCCGTCGCCC 75900 
LVVGVNSDASVRRLKG PRRP 

75901 GGTGATCCCCCTCGCCGAACGCATGCGCGTCCTCGCCGCCCTGAGCTGCGTGGACCTCGT 7596 0 
VI PLAERMRVLAALS CVDLV 

75961 CGTGCCCTTCGACGACGACAGCCCCGCCGCCCTCATCGAGGCCCTCCGCCCCGAGGTCTA 7 602 0 
VPFDDDSPAALI EALRPEVY 

76021 CGCCAAGGGCGGGGACTACACCCTCGCGACCCTGCCCGAAGCACCCCTCGTCCAACGGCT 7608 0 
AKGGDYTLATLPEAPLVQRL 

76081 CGGCGGCGTCGTCCACCTGCTCCCCAGCGTCGCCGACACCTCCACCACCGACATCATCCG 7614 0 
GGVVHLLPSVADTSTTD I IR 

76141 GCGCATCCACGCCCTGTCCAGGACCGGCGAGGGAGACACCCCATGAGCCACGCCATCGGA 762 00 

M S H A I G (orf8) 
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76201 CCGAGCCGGCTGATCCCCGCCATCCGCGAAGCGCTCGGGGACGAGAAGGACCCCCGGCTC 762 60 
PSRLI PAI REALGDEKDPRL 

762 61 GCCCTCTACGTCCACGTCCCCTTCTGCTCCTCCAAGTGCCACTTCTGCGACTGGGTCACC 7632 0 
ALYVHVPFCSSKCHFCDWVT 

76321 GACATCCCCGTCGCACGCCTGCGCGGCGACAGCCGGGAACGCTCGCCCTACGTCACCGCC 763 8 0 
DIPVARLRGDSRERSPYVTA 

76381 CTCTGCGACCAGATCCGCTTCTACGGCCCCCAGCTCACCCGGCTCGGCTACCGCCCCGAG 7644 0 
LCDQIRFYGPQLTRLGYRPE 

76441 GTCATGTACTGGGGCGGCGGCACCCCCACCCGGCTCACCGGCGACGAGATGACGGCCGTC 7650 0 
VMYWGGGTPTRLTGDEMTAV 

76501 CACCAGGCCCTCGACGACGCCTTCGACCTGACGGGACTCCGCCAGTGGTCGGTGGAGAGC 76560 
HQALDDAFDLTGLRQWSVES 

76561 ACCCCGAACGACCTCGACCCCGCCACCCTCGACACCCTGCGCGGCCTCGGCGTCACCCGC 7662 0 
TPNDLDPATLDTLRGLGVTR 

7 6621 GTCAGCGTCGGCGTCCAGTCGCTCAACCCGTACCAGCTGCGCAAGGCAGGCCGGGCCCAC 766 8 0 
VSVGVQSLNPYQLRKAGRAH 

76681 TCGCGCG AACAGGC CCTGGCCGCCGTCCCCCTGTTGCGCCGCGCCGGCATCGACGAGTTC 7 674 0 
SREQALAAVPLLRRAGIDEF 

76741 AACGTCGACCTGATCGCCGGCTTCCCCGGCGAAGCCGTCGAGTCCTTCGAGGAGACCCTG 76800 
NVDLIAGFPGEAVESFEETL 

76801 CGCACCGTCCTCGCGCTCGACCCGCCGCACGTCTCCGTCTACCCCTACCGCGCCACCCCC 76860 
RTVLALDPPHVSVYPYRATP 

76861 AAGACGGTCATGGCCATGCAGCTCGACCGCGAGTTCGTCGAGGCCCGGAACCGGGACGGC 76 92 0 
KTVMAMQLDREFVEARNRDG 

76921 . ATGATCGACGCCTATGAACGGGCCATGGCCGCGCTCGGCGCCGCCGGCTATCACGAGTAC 76 980 
M I DAYERAMAALGAAGYHEY 

76981 TGCCACGGCTACTGGGTGCGCGACGCGCGCCACGAGGACCAGGACGGCAACTACAAGTAC 7704 0 
CHGYWVRDARHEDQDGNYKY 

7 7 041 GACCTGGCCGGCGACAAGATCGGCTTTGGCAGCGGCGCCGAATCGATCATCGGTCACCAC 77100 
DLAGDKIGFGSGAES I IGHH 

77101 CTGCTCTGGAACGAGAACAGCGCCTACGCCCGCTACCTGCTCGCCCCCCGCGAGTTCTCC 7716 0 
LLWNENSAYARYLLAPREFS 

77161 GCCGCCCACCGGTTCACCACCGCCGAACCCGACCGCCTGACCGCCCCCGTCGGCGGCGCG 7722 0 
AAHRFTTAE PDRLTAPVGGA 

77221 CTGATGACCCGTGAAGGCGTGGTCTTCGCCCGCTTCCGCAGACTGACCGGCCTGGACTTC 772 8 0 
LMTREGVVFARFRRLTGLDF 

77281 GCGGACGTCCGCGCCACACCGTACTTCCGCCAGTGGTTCGAGCTCCTGGAGCGCTGCGGC 7734 0 
ADVRATPYFRQWFELLERCG 

77341 GGCCGCTTCGTCGAGACGCCGTACAGCCTCCGCCTGGAGCCGTCCACCATCCACCGCGCC 774 0 0 
GRFVETPYSLRLiEPSTI HRA 

774 01 TACATCACCCACCTCGCCTACACCATGGCCCATGGCCTGGCCCCCGAACGCGCCTGA 774 5 7 
YITHLAYTMAHGLAPERA* 

SEQ ID NO: 2 ORFS BLM gene cluster ORFs 31-40 

(notice this part is on the reverse strand and the last nucleotide (18660) is 
the first (1) on the whole cluster of 77457 bp. Also the last orf (40) is 
incomplete and contains frame shifts) 
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1 GTGACCGAGAACCTTCCGTCGTGCCCCGAATGCTCCAGCGCGTACACCTATGAGATGGGT 60 
MTENLPSCPECSSAYTYEMG 
(orf 31) 

6 1 GCGCTCCTGGTCTGCCCCGAATGCGGCCACGAGTGGCCGCCCGCGACCGCCGAGTCCGCG 120 
ALLVCPECGHEWPPAT AESA 

121 GACAACCCCGAAGACGGCGCGATCAGGGACGCGGTCGGCAACGTACTCGCCGACGGCGAC 180 
DNPEDGAI RDAVGNVLADGD 

181 ACCGTCACGGTGGTCAAGAGCCTGAAGGTCAAGGGCCACCCGACCGGCATCAAGGCCGGC 24 0 
TVTVVKSLKVKGHPTGI KAG 

241 ACCAAGGTGCGCAACATCCGCCTCGTGGAGGGTGTGGCCGGCCACGACATCGACTGCAAG 3 00 
TKVR NI RLVEGVAGHDI DCK 

3 01 ATCGACGGGTTCGGCGCCATGCAGCTCAAGTCCAGCGTGGTCAAGAAGGTCTGACCGGTT 3 6 0 

IDGFGAMQLKSSVVKKV* 

361 ACGCCGGCCCAGGCCCTGCCCAGGCTCCACTACGCCGCGGCGCAACCGAGCCGGAACGGG 4 20 

421 GCCCGGGCCCGCTCCAAGTCCCGTTCCGTGCGCGGCCGCGGCAGCCAGGCCGTGTTCACC 4 80 

4 81 CTGGGGTCGCCGTCCCCGTTCGCACGCGTCGTACACGCCACCACGCACGGCACGGAACTC 54 0 
54 1 CCCGAACTCGCCACGTTCCCCAAGTCCCCGCGTGCCCGGATCCGCCCGGACCGGCGTCGG 6 00 
601 TC CG C CCG C CGGGC CG CGG C CGGGTC CC CGGG C CG CGG CGGGAGGGGGTCT CGCG C CGTG 660 
661 GAACGCCGGCCGGAAATTTACGTATAGGTAGAGATCCCGGCGAAGCGATCGGCGCGTTAT 72 0 
721 GGCAGCATCCGCGCCGGCCCGCCGCGCAGTTCCTCGGTCCCGGACCGATGGCGTCAAAAG 780 
781 TGAGCGACGAAATCGCCGGATCGCGCGAGGACCGTCGCGGGCCGCACGAGGACAACCGGG 84 0 



901 CG CCTGTAGGG ATC ATGG CCGGGAATGGC CT C AG ACG CTTTG AGTG C C C ACCTTG AGGTT 960 

MASDALSAHLEV 
(orf 32) 

961 TCCGACTGTCGGCAGCGCGGGGGGATCACGGTGACGAATGACGGATCTGAACTCGCCGGG 102 0 
SDCRQRGGITVTNDGSELAG 

1021 CAAAACGTGGCGGCGGTCCGCTTCGAGCGGTATTCCGCGATCGCGCCGGAGCGGACCGCC 1080 
QNVAAVRFERYSAIAPERTA 

1081 ATCCTGCACAAAGGTGCCGCGACCGGTTACGACGAGCTCAACCGCCGGGCCGAGCTGACA 114 0 
I LHKGAATGYDELNRRAELT 

1141 GCCACGCGCCTGGCGGACGCGGGCGCCGGCCCCTCGACCCTGGTGGCAGTGGCCCTCCCA 12 00 
ATRLADAGAGPSTLVAVALP 

1201 CGCGATCCCGACCTCGTCGCCACCCTGTGCGCCCTGCTCAAACTGGGTGCCGCATGCCTT 12 60 
RDPDLVATLCALLKLGAACL 

1261 CCCCTGGATCCCGGCATACCGGCCGGGCGGCTGCGCGAGATCATGGCCGACGCGTCCCCC 132 0 
PLDPGI PAGRLREIMADASP 

1321 GACGTTCTCGTCACCACC CGTGCCGTCGCTCCGG CATTCACCGGTGACGGAC C CGT C CTT 13 8 0 
DVLVTTRAVAPAFTGDGPVL 
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1381 TTCCTGGACGACGCTCCTCCGACCTGCTCCGCCGTCCTTCCACGGCACTCAGCGGGGACC 144 0 
FLDDAPPTCSAVLPRHSAGT 



1441 GCGTCGGAAATCGCCTATGTGCTGTACCCGACGACTCCTGACGAGAAGTCCGAAAATTCG 1500 
ASEIAYVLYPTTPDEKSENS 



1501 GTCGTCTCCTATCGTGATATGGCGCGCTACCTTGACGACCCCACTGCCGGGATTCCGGCG 1560 
VVSYRDMARYLDDPTAGI PA 

1561 AGGGCGGAGATTCTCCGGCTGGTCGCGCCGCTCCTGTCCGGCGGTCGTCTGGTGCTGGAC 162 0 
RAEI LRLVAPLLSGGRLVLD 



1621 GCCGACGAGACCCGGCCCCGGCCGGTCACCCGTGAGGCGCCGCGCGACATGGTGGAGGAC 1680 
ADETRPRPVTREAPRDMVED 

1681 GTCGTGGCGCAGGTCTGGTGCGCCGTGCTCGGCGTGGACCGGGTGGGCGTGCGGGACCGC 174 0 
VVAQVWCAVLGVDRVGVRDR 

1741 TTCTTCGACCTGGGCGGCAAGTCGCTGGCGGCGGTCCAGGTGGTGGCGCGCCTGCGGAAG 1800 
FFDLGGKSLAAVQVVARLRK 

1801 CTGCTCGGCGTCGAGCTGCCGCTGCGGGCCCTGTTCGACGCGCCGACGGTCGAGGAGCTG 1860 
LLGVELPLRALFDAPTVEEL 



1861 GCCGCCCGGGTGCGGGCCGAACAGGCCGGCGGCCAGGGCGTCCGGGAGGAGGCGGCGCTC 192 0 
AARVRAEQAGGQGVREEAAL 



1921 GAGCCGGTGGGCCGGAGCGAGCCGCTGCCGCTGTCGTTCGCACAGCAACGCCTGTGGTTC 198 0 
EPVGRSEPLPLSFAQQRLWF 

1981 CTGGACCGCTTGATGCCCGACCGCGCCTTCTACACGATGTGCGACGCGTTCCGCGTCCGG 204 0 
LDRLMPDRAFYTMCDAFRVR 

2 041 GGCGGGATCGACCTGGGTGCGCTGCGGCGGGCCCTGCGGATGCTGGTGGGACGGCACGAG 210 0 
GGIDLGALRRALRMLVGRHE 

2101 ACGCTGCGGACGGCGTTCGTCGAGCGGGACGGTGTGCCGTACCAGCTCGTCGGTCCGGCC 216 0 
TLRTAFVERDGVPYQLVGPA 

2161 GACGGGCCCGGTGCGCGGCGCGTGGCCGCTCCCACGCGGGTCGACCTGTCGCTGCTGGAG 222 0 
DGPGARRVAAPTRVDLSLLE 

2221 CCCGCCGAGCGGGAGGAGGCGGTGCGGAACCTGGTGGCGGCGGAGGCGCGGACCCCGTTC 22 80 
P A E R^E^/E AVRNLVAAEARTPF 

22 81 CGG CCGG CGGACGG CG CG CTG CTG CGCGTGGTGGTGG C CCGG CTGG CGG ACGATGAT CAC 234 0 
RPADGALLRVVVARLADDDH 

2341 GTGCTGGTGGTCAGCACGCACCACATCGTCTCCGACGCCTGGTCCGTGGGTGTGCTGGTG 24 00 
VLVVSTHHIVSDAWSVGVLV 

24 01 GACGAACT CGG ACGG CTGT AC CGCG AGTGCGTC ACCGG AGAT C C CGCCGCGCTG C CC C CG 2460 
DELGRLYRECVTGDPAALPP 

24 61 CCGG CCGTCCAGTACGCCGACTTCGCGGTCTGGC AG CGGGC CTGG ATGGCCGGTCCGGTG 2 52 0 
PAVQYADFAVWQRAWMAGPV 

2521 CAGGAGGAGCATCTCGCGTACTGGAAGCGGGCCTTGGACGGCGCTCCCTCGGTGCTGCGG 2580 
QEEHLAYWKRALDGAPSVLR 

2581 CTGCCCATGGACCACCCGCGGCCCGCCGTGCAGTCCGAGCGGGGCGAGACGGTCGGGTTC 264 0 
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LPMDHPRPAVQSERGETVGF 

2641 G CGCTG CCCGACGCGCTGGTCG CCGCG CTGG AG AAG CTGGGCCGGG AG CAGGGCGCC ACC 2700 
ALPDALVAALEKLGREQGAT 

27 01 CTGTTCATGACGCTGCTCGGCGCCTTCCAGGTCCTGCTGGCGCGTCACGCCGGGCAAGAG 2 760 
LFMTLLGAFQVLLARHAGQE 

27 61 GACATCGTGGTCGGCGTGCCGGCGGCGGGGCGCACCCGGACCGAGACGGAACCTCTGGTC 2820 
DIVVGVPAAGRTRTETEPLV 

2821 GG CTT CTT CGTC AAC ACG CTTC CCTTG CGGG CG ATCTGCGCTC CGGG C CTGTCGTTCCGG 2880 
G FFVNTLPLRAI CAPGLS FR 

2 881 GACCTGCTGGACCAGGTGCGCGAGGCCGCCCTCGGCGCCTTCGCCCATCAGGACCTCCCC 2 94 0 
DLLDQVREAALGAFAHQDLP 

2 941 TTCGAGGCGCTGGTCGAGGCGCTCGCACCCGAGCGCGACCTCGGCCACAATCCCCTCGTC 3 00 0 
FEALVEALAPERDLGHNPLV 



3 0 01 CAGGTCACCTTCCAGCTCCTGGGCACACCGGCGGCGCGGCCGGACCTGATCGGGACGGAG 3 060 
QVTFQLLGTPAARPDLIGTE 

3 061 GTCGAGCGGTACCCGGTCCAGGAGGCCGTCTCGCAGTTCGACCTGTCCCTGGACATCAAG 3120 
VERYPVQEAVSQFDLSLDIK 

3121 CGGGCCGACGACGGTTCCTACCGGGGGATCCTGAACTACTGCCCCGACCTGTTCGACCGA 3180 
RADDGSYRGI LNYCPDLFDR 

3181 CGCCGCATGGAGGTGCTGGTCGGCCACTACCTGACGCTGCTCGGCGCCGCCGCCGCGGAC 3 24 0 
RRMEVLVGHYLTLLGAAAAD 

i*n ...... 

^ 3241 CCGGGCCGCCCGATCGGTGAGCTGCCGCTGTCCGACGGGGCCGAACGGCTGCGGCTGCTC 3 3 00 

PGRPIGELPLSDGAERLRLL 



33 01 GACGGGTTCGGGAAGCGGGACGCGGCGTACGCCGGGCCGGGAAGCGTTCCGGAGCGGTTC 33 6 0 

DGFGKRDAAYAGPGSVPERF 

3361 GCGGAGGTGGCGCGGACGGCACCGGACGCGCGGGCGGTGACGTGTGGCGCGACAACGCTC 34 2 0 
AEVARTAPDARAVTCGATTL 

34 21 ACCTTCGCCGAGCTGAACGACCGGGTGGAGCGCCTGGCACAGGCACTGCTCGGCGCCGGG 34 80 

TFAELNDRVERLAQALLGAG 

34 81 GTCACCCGCGAGACGCCGGTCGCGGTCCGCCTGCCCCGTTCCACCGACAGCGTCGTCGCC 3 54 0 
VTRETPVAVRLPRSTDSVVA 

3 541 CTGCTGGCCGTCATGCGGGCGGGCGGCGTCTACGTCCCCCTGGACCCCGACTGGCCCGCG 3 600 
LLAVMRAGGVYVPLDPDWPA 

3 601 GACCGCACCGCCTACATCCTGGACGACACCGCGGCCTCCGTCGTCATCACCCGCGACCTG 3660 
DRTAYI LDDTAASVVITRDL 

3661 CCCGCACTCCCCGGTCGCCTCCACGTCGACCCGCGCCGGCCCGCGGCCGACGGCCTGGTA 3 72 0 
PALPGRLHVDPRRPAADGLV 

3721 CCCGCGCCCCGCATCGACCCCGATCAGGCCGCGTACGTCATCTACACGTCCGGCTCGACG 3 780 
PAPRIDPDQAAYVIYTSGST 

3 7 81 GG CGCG CCG AAGGG CGT CGTCGTCCGG CAC CGCTC C CTG AACCACCTCAC CAGCG C CCTG 384 0 
GAPKGVVVRHRSLNHLTSAL 
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3 841 CAGGCCACCTTTCTCGGCCACGACCCGTATCTCGCCGGGGCCGACGGCGTACCGCCCGGG 3 900 
QATFLGHDPYLAGADGVPPG 

3 901 GACGCGAAGCTGCGTACGACGCTCACCGCGCCCTTCACGTTCGACGCGTCCATGGAGCAA 3 960 
DAKLRTTLTAP FTFDASMEQ 

3961 CTGAGCTGGATGCTGGCCGGTCACGAGCTGTTCATCGTGCCCGAGGACGTGCGGCGCGAC 4 02 0 
LSWMLAGHELF IVPEDVRRD 



4 021 CCCTCGGCGCTGGTCCGGTTCGTCCGGGAGCACCGGATCGACGTCATCGACACGACCTCC 4 080 
PSALVRFVREHRI DVI DTTS 



4 081 TCGCAGCTCGAACTCCTCGTATCGCACGGGCTGTTGGACGGAGAGTGGGCGCCGTCCATG 414 0 
SQLELLVSHGLLDGEWAPSM 

4141 GTCATGGTGGGTGGCGAGGCGGTCTCGCCGTCGCTGTGGCGGACCTTGCGGGACCAGCGG 4200 
VMVGGEAVS PS LWRTLRDQR 

4 2 01 CGCACTCGCTGTTTCAACCTGTACGGGCCTACGGAGGCGACGGTCGACGCCACCTGCCAC 4260 
RTRCFNLYGPTEATVDATCH 



GACCTGTCCGACCCCGCCGACGTCCCCGTCATCGGCACCCCACTCCCCCACACCCACGTC 432 0 
DLSDPADVPVIGTPLPHTHV 



CGCGTGCTCGACGACCGACTGCGACCCGTACCCGTGGGCGTCGCCGGCGAGATCTACCTC 43 80 
RVLDDRLRPVPVGVAGEIYL 



GGCGGAACCGGCCTGGCCCGCGGCTACCTCAACCGCCCCGCCCTCACCGCCCGACGCTTC 44 4 0 
GGTGLARGYLNRPALTARRF 



GTCGCCGACCCCTACCCCGACACCCCCGGCAGCCGCCTGTACCGCACCGGCGACCGCGCC 4 5 00 
VADPYPDTPGSRLYRTGDRA 

CGCTGGCGCCCCGACGGCACCCTCGAATACCTGGGACGCACCGACGACCAAATCAAGATC 4 56 0 
RWRPDGTLEYLGRTDDQIKI 

CGCGGCTTCCGCGTCGAACCCGGCGAAATCGAGGCCGTCCTCACCCACCACCCCGCCGTC 4 62 0 
RGFRVEPGEI EAVLTHHPAV 

4 621 AAGGAAGCCGCCGTCGTCGACGACGCGCACGCGCGGCTGGTCGCCTACGTCACGCTCGCG 4 68 0 
KEAAVVDDAHARLVAYVTLA 



4 681 GAAGGCGGCGGCGCCGGCCCCACCGACGTACGCCGGTTCGCGCAGGGGCGGCTGCCCGCC 4 74 0 
EGGGAGPTDVRRFAQGRLPA 

4741 CACATGGTGCCGTCGGCGGTGGTCGTCCTGGAGGCGCTGCCACTGACGTCGAACGGAAAG 4800 
HMVPSAVVVLEALPLTSNGK 



4 801 CTGGACCGCGCGCGCCTGCCGGCGCCCGCGGCGGGCAGACCGGAACTGGATGTCCGCTTC 4 860 
LDRARLPAPAAGRPELDVRF 



4 861 GTGGCGCCGCGCGACATGGTGGAGGAGGTCGTGGCGCAGGTCTGGTGCGCCGTGCTGGGC 4 920 
VAPRDMVEEVVAQVWCAVLG 

4 921 GTCG AC CGGGTCGGTGTG CACG ACG ACTTCTTCG AGCTGGGCGGG C ACTCGTTG CTGGTG 4 980 
VDRVGVHDDFFELGGHSLLV 

4 981 GTC CAGGTG ATG ACC CGG ATACG AAAGCTGCTCGG CGT CG AGGTG CCGTTGCGGGAGCTG 504 0 
VQVMTRIRKLLGVEVPLREL 

5041 TTCGACGCCGCGACGGTCGAGGAGCTCGCCGCCCGCGTCCGCGCCGCACGGACCGAGGGC 5100 



4261 



4381 



4501 



( 8 *l 

;cr 4561 
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FDAATVEELAARVRAARTEG 



5101 CTCGG CCGGGGGG CCGCC C CGCC CCTCGGGCCGGTGGACCGGAG CGGGCCG CTG C CG CTG 516 0 
LGRGAAPPLGPVDRSGPLPL 



5161 TCGTTCGCGCAGCAACGCCTTTGGTACCTCGATCAGTTGGCGCCCGACAGTGTCTCCTAC 522 0 
SFAQQRLWYLDQLAPDSVSY 

5221 AACATGTGCGACGCCTACCGGGTCCGCGGCCCTCTCGACCTGGACGCGCTGCGGCGGGCG 528 0 
NMCDAYRVRGPLDLDALRRA 

52 81 CTGCGGACGCTGGTCGAGCGGCACGAGACGCTGCGGACGGCGTTCGTCGAGCGGGACGGG 534 0 
LRTLVERHETLRTAFVERDG 



5341 GTGCCCCACCAGGTGGTCTCGGCGCCCGACGCGCCGGCCGCGCGGCGCGCGGCGGAGGTC 54 00 
VPHQVVSAPDAPAARRAAEV 

54 01 GTGCGGATCGAGGCGGCCGGGCGGACCGACGAGGCGGTGCGGGACCTGGTGGCCGCGGAG 54 60 
VRI EAAGRTDEAVRDLVAAE 

54 61 GCGCGCACCCCGTTCCGGCCGGCGGACGGCGCGCTGATGCGCGTGGCGGTGGCCCGGCTG 552 0 
ARTPFRPADGALMRVAVARL 

5521 GCGGACGACGATCACGTGCTGGTGGTCACCACGCACCACATCGTCTCCGACGGCTGGTCG 5 58 0 
ADDDHVLVVTTHH IVSDGWS 

5581 GTCGACATCCTGGTGGACGAATTGGGGCGCCTGTACCGGGAACACGTCACGGGTGACCCC 564 0 
VDILVDELGRLYREHVTGDP 

5641 GCCGGGCTCCCTCCGCTCGACGTCCAGTACGCCGACTTCGCCGTCTGGCAGCGGTCCTGG 57 00 
AGLPPLDVQYADFAVWQRSW 

5701 ATGACCGGCCCCGTGCGGGAGGAGCACCTCGCGTACTGGAAGCGGGCCCTGGACGGGGCA 576 0 
MTGPVREEHLAYWKRALDGA 

5761 CCCTCGGTCCTGCGGCTGCCGGCGGACCATCCGCGTCCCGCCGTCCAGTCCCAGCGGGGC 582 0 
PSVLRLPADHPRPAVQSQRG 

5821 GAGACCGTCGAGTTCCCCCTGCCCGCACCACTGGTCGCGCGGCTGGAAGCGCTCTGCCGG 58 80 
ETVEFPLPAPLVARLEALCR 

5881 GAG CAGGGCGT C AC CCTGTTCATGG CG CTCTTCGGCG CGTT C CAGGTGTTG CTGG CGCG C 5 94 0 
EQGVTLFMAL FGAFQVLLAR 

5 94 1 T ACAG CGGT CAGGACG ACGTGGT CGTGGG CGTG C CGACGG CG AAC CGCACC CG CG CGGAG 6000 
YSGQDDVVVGVPTANRTRAE 

6001 ACCGAGCCCCTGGTCGGCTTCTTCGTCAACACCCTTCCGGTACGGGTCGCGTGCTCGCCG 6 060 
TEPLVGFFVNTLPVRVACS P 

6061 GAGCTGTCGTTCCGCGCCCTGCTCGACCGGGTCCGCGAGGCCGCGCTGGGCGCCTTCGCC 612 0 
ELSFRALLDRVREAALGAFA 

6121 CATCAGGACCTGCCCTTCGAGGCGCTGGTCGAGGCGCTCGCGCCCGAGCGCGACCTGGGC 6180 
HQDLPFEALVEALAPERDLG 

6181 CACCACCCTCTCGTGCAGGTCACCTTCCAACTCCTCGACGCTCCCGACGAGAGGCTCGTC 624 0 
HHPLVQVTFQLLDAPDERLV 

6241 CTGCACGGCACGGACTGCGTCTCGCTCGGCTTCGGCGGTGTGACCAGCCGGTTCGACCTG 63 0 0 
LHGTDCVSLGFGGVTSRFDL 
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63 01 TCCCTCGACGTCGTCTCGGGGCGGCGGGGGAAGCGGTGCGTGCTGACGTACTGTCCCGAC 63 6 0 
SLDVVSGRRGKRCVLTYCPD 

6361 CTGTTCGACCGGCCCCGCATGGAGGTGCTGGCCGGCCACTACCTGACCCTGCTCGGCGCG 6420 
LFDRPRMEVLAGHYLTLLGA 

6421 GCGGCCGACGATCCCGGTCTCCGCGTCGGCGACCTCCCGCTGAGCGACGACGTCGAACGC 64 80 
AADDPGLRVGDLPLSDDVER 

6481 CTGCGCCTGCTGGGCGGGTCCCGCCCGCGGTACCTGCCCGCGCCCGGGGCGGAGACCGTC 654 0 
LRLLGGSRPRYLPAPGAETV 

6541 CCTGACGCCTTCGCCGCGCAGGTGCGGGCGACACCGGACGCGCCCGCGCTGGTCCACGGG 660 0 
PDAFAAQVRATPDAPALVHG 

6601 GACTCGACGCTGACGTTCGCCGAGCTGGACACCCGGGTCACCGCCCTGGCCGTGCGGTTG 6660 
DSTLTFAELDTRVTALAVRL 

6661 CGGCGCTGCGGCGTGGCCGCCGAGACGCCGGTCGCGGTGTGCCTGCCGCGCTCCGCCGAC 672 0 
RRCGVAAETPVAVCLPRSAD 

6721 GCCGTCGTGGCCCTCCTGGCCGTCCTGCGGGCGGGCGGCGTCTATGTGCCAGTGGATCCG 678 0 
Q AVVALLAVLRAGGVYVPVDP 

is £ 6781 GAGTGGCCCTCCGGCCGCGTCGCCCACGTCCTCGACGAGACCGCGGCCCCCGTCGTCATC 684 0 

Sj EWPSGRVAHVLDETAAPVVI 

i (f% 6841 ACCCGCGACCTGCCCGCCGATCCCGGCCGCGTCCACCTCGACCCGCGCCAGGCCCCGGCC 6 900 

,™ TRDLPADPGRVHLDPRQAPA 



6 901 GACGACCGGGATCCCCTGCCGCGCCTCCACCGCGACCAGGCCGCGTACATCATCTTCACC 6 96 0 
DDRDPLPRLHRDQAAYI IFT 

6961 TCGGGCTCCACCGGCGCCCCCAAGGGCGTCGTCGTCCGACACGGCTCCCTGTACCACCTC 7 02 0 
SGSTGAPKGVVVRHGSLYHL 

7021 CTGGGCCACGTACGGCGCATGGCGGAGGGCGGCCCCCGGCGGAACGTCGCGCACACCACC 7 080 
LGHVRRMAEGGPRRNVAHTT 

7081 GCGATGACCTTCGACCCGTCGCTGGAACAGTTCCTGTGGCTCGTCGCCGGACACACCCTG 714 0 
AMTFDPSLEQFLWLVAGHTL 

7141 CACGTCGCGCCCGAGGAGGTGCGCCGCGATCCCGAGGCGCTGGTGGCCCTGGTGCGGCGC 72 00 
HVAPEEVRRDPEALVALVRR 

7201 GCCGCGATCGACGTCCTCAACGTCACCCCGTCCCACCTGACCCTGCTGATCGAGGCCGGG 72 60 
AAIDVLNVTPSHLTLLIEAG 

7261 CTG CTGGAGGG CG ACCGGGTGC CGGGTACGGT CCTGGTGGGTGG CG AGG CGGTGC C CG CG 7320 
LLEGDRVPGTVLVGGEAVPA 

73 21 GCGCTGTGGCGGACCCTGCGCGAACGGACGGGAGCCACCCGCTTCTTCAACCTGTACGGG 73 80 
ALWRTLRERTGATRFFNLYG 

73 81 CCTACGGAGGCGACGGTCGACGCCACCTGCCACGACCTGTCCGACCCCGCCGACGTCCCC 744 0 
PTEATVDATCHDLSDPADVP 

7441 GTCATCGGCACCCCACTCCCCCACACCCACGTCCGCGTGCTCGACGACCGACTGCGACCC 7500 
VIGTPLPHTHVRVLDDRLRP 

7501 GT AC C CGTGGG CGTCGCCGGCG AAATCT AC CTCGG CGGAACCGG CCTGGCCCG CGGCTAC 7 560 
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VPVGVAGE IYLGGTGLARGY 



7 561 CTCAACCGCCCCGCCCTCACCGCCCAACGCTTCGTCGCCGACCCCTACCCCGACACCCCC 762 0 
LNRPALTAQRFVADPYPDTP 

7621 GGCAGCCGCCTGTACCGCACCGGCGACCGCGCCCGCTGGCGCCCCGACGGCACCCTCGAA 768 0 
GSRLYRTGDRARWRPDGTLE 

7681 TACCTGGGACGCACCGACGACCAAATCAAGATCCGCGGCTTCCGCGTCGAACCCGGCGAG 774 0 
YLGRTDDQIKIRGFRVEPGE 

7741 ATCGAAGCCGTCCTCACCCACCACCCCGCCGTCAAGGAAGCCGCCGTCACCGTGGCCACC 7 8 00 
I EAVLTHHPA VKEAAVTVAT 

7801 GACGACGGTGCCGCCCGGCTGGTCGCCCTCGTCGTCCCCGCCCCCCGCGCCCCGCACGGC 7860 
DDGAARLVALVVPAPRAPHG 

7861 GATTCGGCCGACGGCGCCCCGGACGCCCAGGTCGAGGAGTGGAACGCCGTCTTCGAGGCG 7920 
DSADGAPDAQVEEWNAVFEA 

7921 ACCCACACCGACGCCGCCGACGGCGAACTCACCTTCAACATCAAGGGCTGGAACGACAGC 7980 
THTDAADGELTFNI KGWNDS 

7981 CTCACCGGTGCGCCGATCCCCGCCGAACACATGCGGGAATGGGTCGACACCACCGTCGCC 804 0 
LTGAPI PAEHMREWVDTTVA 

8041 CGGCTCCTGGAACGGCCGGCCGAGCGCGTCCTGGAGATCGGCAGTGGCACCGGGCTGCTG 810 0 
RLLERPAERVLEIGSGTGLL 

8101 ATGTGGCGGCTGCTGCCGCACGTCACCGAGTACACCGGAACCGACTTCTCGCGGCCCGCC 816 0 
MWRLLPHVTEYTGTDFSRPA 

8161 GTGGACTGGCTCCGGGACGGGCTGCGCCGCCGCCCCGCGCACCGGGTACGGCTGCTGCAC 822 0 
VDWLRDGLRRRPAHRVRLLH 

8221 CGCGAGGCGACCGACTTCACCGGCGTCCGCGCCGCGTCCACCGACCTCGTCGTCGTCAAC 82 80 
REATD FTGVRAASTDL VVVN 

8281 TCGGTCGTCCAGTACTTCCCCGACCGCGCCTACCTCGACACCGTCCTGGCCCGCGCCCTC 834 0 
SVVQYFPDRAYLDTVLARAL 

8341 GACGCCACGGCCGACCGAGGGCGCGTCTTCGTGGGCGACGTGCGCAACCTGGCCCTCGCC 84 00 
DATADRGRVFVGDVRNLALA 

8401 CCGCAGTTCTACGCCCGTCAGGCCCTCGCCCACGCCGGTCCGGGCGCGGCGGCGCGGGAC 84 60 
PQFYARQALAHAG PGAAARD 

8461 GTGGCGCGCGCCGCCGGCGAGTTCGCGGCCATGGACGGCGAGTTGCTGGTGTCCCCCGCG 8 52 0 
VARAAGEFAAMDGELLVS PA 

8521 TACTTCGCCGCGCTCGCCGCCCGCTCGCCCCGCGTCACCGGCGTCGAGATCCTGCCCCGC 8 58 0 
YFAALAARS PRVTGVEILPR 

8581 CGGGGACGGCAC CGCAACGAG ATG AG C CTGT AC CG CT ACGACGTGGTGCTG CACGTGGGC 864 0 
RGRHRNEMSLYRYDVVLHVG 

864 1 GGTGACCGCCCGGCGGCCCCGGAGGCGGAGGTGCTCACCTGGGGCGACCAGGTGCACGAC 8700 
GDRPAAPEAEVLTWGDQVHD 

8701 CTCGCGTCGCTGTCCGCCCGCCTCGGCCGCGGGGGCCCGGACGCCCTGCTCGTGCGCGGC 876 0 
LASLSARLGRGGPDALLVRG 



47 



I s ! I 



8761 GTCGCCAACGACCGTCTGACGCGGGACAACGAGCTGCTCGACGCACCCGCCCGCACGACG 882 0 
VANDRLTRDNELLDAPARTT 

8 821 GCCGTCGAGCCCGAGGACCTGTGGGGGCTGGCGGACTCCACCCCCTACCGGGTGAGCGTC 888 0 
AVE PEDLWGLADSTPYRVSV 

8881 AGCTGGGCCG CCGCCGATCCGCGGGG CG CG ATGG ACGTCCTGCTGGTC CGG CGGG ACG C C 8 94 0 
SWAAADPRGAMDVLLVRRDA 

8 941 CACGACGACGGTCCGCTGCTCGTCCCCCACCCCGTACCGGAGCCCTCGGCACCGCTGACG 9000 
HDDGPLLVPHPVPEPSAPLT 

9001 AACACGCCGACCCGGCACCCGTCCGCGCGGCAAGGGGGCTCGGCCGCGGACGGGCTGCGT 906 0 
NTPTRHPSARQGGSAADGLR 

9061 TCCTGGCTCGCCGAGCGGCTTCCCGCGCACCTGCTGCCCGCGAGGATCACCGAGGTGGAC 912 0 
SWLAERLPAHLLPARI TEVD 

9121 GCGCTGCCCCGCACCGGCACCGGCAAGCTCGACCGGGGCGCGCTCGGCGGACTCGTGACC 918 0 
ALPRTGTGKLDRGALGGLVT 

9181 GCGGGCCGTGGCGCCCGGGCGGGCGACCGCCCCGCCACCGCCCCCCGTACGGGTCTCGAA 924 0 
AGRGARAGDRPATAPRTGLE 

92 41 CGGACCCTGGCCGACGCGTGGGCGCGGGTGCTCGGCCTCCCCGAAGTCGGCGTGCACGAG 93 00 

RTLADAWARVLGLPEVGVHE 

93 01 AACTTCTTCGCCCTCGGCGGCGACTCCCTCCTCGCCGTCAGGGCTGTCGCCCGGTGCCGC 93 6 0 

NFFALGGDSLLAVRAVARC R 

93 61 CGTGCCGGGGTCCGACTGACCGTCCGGCAGTTGCTGAGCGAGCAGACCGTCGCCGCGCTC 94 2 0 

RAGVRLTVRQLLSEQTVAAL 

94 21 GCGGCGGCCCTCGAGGAGGAGTCTCAATGATGAAGTCAAGCCGCTTGCGCGACCGGCAGC 94 8 0 

AAALEEESQ* 

MMKS SRLRDRQL 
(orf 33) 

94 81 TCGGGGGTG AAG AC C CGGTTGTCG CGCAGG AGAG C C CAC AGGACG CTGGCC CG ACGC CGT 954 0 
GGEDPVVAQES PQDAGPTPC 

9541 GCCAGGGCGATGACGGCTTGAACGTGTTTGCAGCCCTCGCCGCGCTTCTTGAGGTAGAAG 96 00 
QGDDGLNVFAALAALLEVEV 

9601 TCCCGGTTCGGCCCCTCCCGCATCATGCTGGTTTGGGCCGACATGTAGAACACTCGTCGC 96 60 
PVRPLPHHAGLGRHVEHSSQ 

9661 AGG CGG CGG CTGT AG CG CTTGGG C CG ATG CAGGTTG C C AGTG CG ACG AC CGGAGTCGCGG 9720 
A AAVALGPMQVASATTGVAG 

97 21 GGGACGGGCACCAGGCCGGCCGCCGAGGCCAGGTGACCGGCGTCGGCGTAGGCCGTGAGG 978 0 
DGHQAGRRGQVTGVGVGREV 

9781 TCGCCGGCGGCGACGACGAACTCGGCGCCGAGGATCGGCCCCATGCCCGGCAGAGACTCG 984 0 
AGGDDELGAEDRPHARQRLD 

9841 ATGATCTCGGCCTGTGGATGGCTGCGGAACGTCTCGCGGATCTGCTGGTCAATCCGCTTC 9900 
DLGLWMAAERLADLLVNPLQ 

9901 AGACGGTCGTCCAGGGCCAGGATCTGCGCGGCCAGGTCAGCCACGATCTGGGCGGCGACG 9 960 
TVVQGQDLRGQVSHDLGGDV 
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9961 TCCTCCCCGGGCAGCGCGGTCTGCTGAGCCTGGGCAGCCTCCAGCGCCGTCGCGGCGACG 1002 0 
LPGQRGLLSLGSLQRRRGDG 

10021 GCGTCGGCACCGCGCACGCCTCGGTTGGCCAGCCAGGCCGTCAGCCGGGCCCGGCCGCGG 10080 
VGTAHASVGQPGRQPGPAAA 

10081 CGG CGG AG AG CTG C CGGGGTCTGGT AG C CCGTGAGC AGG ACCAG CGCG C CCTTCTGCGAG 1014 0 
AESCRGLVARQQDQRALLRA 

10141 CTGT AGT CG AAGG C C CGTTCC AGCGCGGGGAAG ACG C CGGT C AGCGTGTCG CGG AGACGG 10200 
VVEGPFQRGEDAGQRVAETV 

10201 TTGATCATCCTGACCCGGTCGGCCACGAGGTCGGAACGGTGGGCGGTCAGCAGCGCGAGG 1026 0 
DHPDPVGHEVGTVGGQQREV 

10261 TCGGCGGCCAGCTGGGCGGGCACGTCGATCGACGCGAAGTCCCGTCGGTTGCGGGCGGTT 103 2 0 
GGQLGGHVDRREVPSVAGGF 

103 21 TCGGCGATGACGTAGGCGTCGCGGGCGTCGGTCTTCGCCTCGCCCCGGTAAGCGCCGGAC 103 8 0 
GDDVGVAGVGLRLAPVSAGH 

10 3 81 ATGCGGTTGACCGTGCGGCCGGGCACGTAGACGGCCTGCTGGCCGTGGGCCGCGAGCAGG 1044 0 
AVDRAAGHVDGLLAVGREQG 

10441 GCCAGCAGCAGCGCGGAGGACGTGCCGGAGATGTCCACTGCCCAGTGGACCTCGTCGGCC 10500 
QQQRGGRAGDVHCPVDLVGQ 

10501 AGGTCGAGGATCTCACCCATGGCGGTCAGGATCGCCGACTCATCGTTGCCGATCTTCTTC 10560 
VEDLTHGGQDRRLIVADLLR 

10561 GACCACAGCGTCACACCGGTCTCGTCGACCACCGCCGCCCAGTGATGCCCCTTGCCCGCG 10620 
PQRHTGLVDHRRPVMPLARV 

10621 TCGATCCCGGCCCAGACCCGGGCCCGTCGCTCGCCCACTCGCCCCTCCTCACTCCGAACA 1068 0 
DPGPDPGPSLAHSPLLTPNS 

10681 GCATCCCGTCGACCCGAGGAACACCCCGCTGTCATCTCCGTAAAAAGCGACCGAAGCGCA 1074 0 
I PSTRGTPRCHLRKKRPKRT 

10741 CATCTCAATCAGCAGCCAGGGCGCCCCGGAGAACCGGGCGGCCACTCCTTGTAAGCCACT 108 00 
SQSAARAPRRTGRPLLVSH* 

10801 GACGGCAGAGAACCATAAGCCACACCCGGCCCTCCCGGGCCGCCTAACAACTTACGGAGA 10860 

10 861 ACCATGACTGACCTGCCGTTGCGTACCGTCGCACTCACCGGTGAGGAGAGCGCGGAGGTC 1092 0 
MTDLPLRTVALTGEESAEV 
(orf34) 

10 921 GACGACCTGCTGCGCACGCTGGCCGACGTGCCGGTCGACTCCACCGTGGGACTGCTGCAC 10980 
DDLLRTLADVPVDSTVGLLH 

10981 CGCACC CGGCTCG C CG CACAGGAACTGCCGCTG CG C ATCCG CGC CGAGCTC ACGGGG ATG 1104 0 
RTRLAAQELPLRI RAELTGM 

11041 CGGCTCTACGACAGCCCGCGCGCCCTCGTCGTCACGGGCTTCGGCGTCGACGACGAACGG 11100 
RLYDS PRALVVTG FGVDDER 

11101 ATCGGACCGACCCCCGCGGCCCGTCCCGCCCCGGATCCCGAGCGGACCCGCGACCTGGAG 11160 
IGPTPAARPAPDPERTRDLE 

11161 CTGCTGCTTTTG CTG CACG CGGCCCTGCTCGGCG AGG CGTTCGG CTGGGCGAC CCAG CAG 11220 
LLLLLHAALLGEAFGWATQQ 
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11221 AACGGCCGGCTCGTCCACGACGTGCTGCCCGTTCCCGGTGAGGAGACCGCGCAGATGGGT 11280 
NGRLVHDVLPVPGEETAQMG 

112 81 TCCAGCAGCGAGACCGAGCTGCTGTGGCACACCGAGGACGCGTTCCACCCGCTGCGCTGC 1134 0 
SSSETELLWHTEDAFHPLRC 



11341 GACTACGTGGGCCTGCTGTGCCTGCGCAACCACCAGCGCGCCGCGACCACCGTGGGCTGG 114 00 
DYVGLLCLRNHQRAATTVGW 



114 01 CCCGACCTGTCCCGGCTCACCACCGAGGACCGTGCCGTGCTCCTCGAACCCCGCTATCTG 114 60 
PDLSRLTTEDRAVLLE PR YL 



11461 ATCCGCCCGGACACCTCGCACACGCCCGCGCAGAACGCGACGGGCACGCGGTCCGCCGAG 1152 0 
IRPDTSHTPAQNATGTRSAE 

11521 CGTTTCGCGGCGATCGCCGAGATGGACGACGCCCCGGAGCGCGTCGCCGTCCTGTTCGGC 11580 
RFAAIAEMDDAPERVAVLFG 



11581 GACCCCGAGGACCCGTACCTGCGGATCGACCCGGCCTACATGAGCCCGGCCCCCGGGGAC 1164 0 
DPEDPYLRIDPAYMSPAPGD 



11641 GCGGCCGCCCGGCGGGCGTACGACACCGTCACCGCGCTCATCGAGGACGAGCTGCGGCAC 11700 
AAARRAYDTVTAL IEDELRH 



11701 GTCGTCCTGGACGCCGGTTCACTGCTGCTGGTCGACAACTACCAGGCGGTGCACGGCCGC 11760 
VVLDAGSLLLVDNYQAVHGR 

11761 AAGCCGTTCGCCGCCGCCTACGACGGCCGCGACCGCTGGCTCAAACGCGTCAACATCACC 1182 0 
KPFAAAYDGRDRWLKRVNIT 



11821 CGCGACCTGCGCCGTTCCCGGTCCGCGCGGCGGTCGGCCACCTCGCTGCTGGTGTGAGGG 1188 0 
RDLRRSRSARRSATSLLV* 



11881 AGGCACCATGGATTTCCCCCTCACCCGCGTCAACCCCTGGTTCAGCGGCGGCTGCGACGG 1194 0 
MDFPLTRVNPWFSGGCDG 
(orf 35) 

11941 CCGCCCCCGGGTGCGGCTGTGCGCGCTGCCGTACGCGGGCGGCACCGCCGCCGTCTTCAA 12 000 
RPRVRLCALPYAG GTAAVFK 

12001 GGACTGGCCCGCCGCGCTGCCCCCCGGAGTGGAGCTGCTCACCGCGCACCTGCCGGGACG 12 060 
DWPAALPPGVELLTAHLPGR 



12061 CGGCGACCGGTTCACCGAACCGCCCCCGGCCACCCTGGAGGAGACCGCCGAGCGGCTGTG 1212 0 
GDRFTE PPPATLEETAERLC 



12121 CGAGGCGCTGCCGCCGAGTGACCTGCCCACGGTCGTCCTCGGCCACAGCATGGGCGCCCT 12180 
EALPPSDLPTVVLGHSMGAL 



12181 G CTGGGGTACG AAGTGGCGG CG CGG CTCGCGGCGCGGGGCCG CG CCC C CAACCTG CTG AT 12 24 0 
LGYEVAARLAARGRAPNLLI 



12241 CGCCGCGGCCTGCCGTCCCCCGCACGTTCCGCCGGACGCCTCCGGTCCGGTGACCGAGGC 123 00 
AAACRP PHVPPDASGPVTEA 



123 01 CG AG CTGG CGG C C AC CCTGCGGG CCGAACGCCC ATGGG ACACGGCC CTGAGGG ACG AGG A 12 3 60 
ELAATLRAERPWDTALRDEE 



12361 ACTGATGGAAGCGGTGCTGCCCGCCCTGGTCGCCGACATCACGGCCGGCGACCGCTACCA 124 20 
LMEAVLPALVADITAGDRYH 



12421 CCGCCCGCGGCCCCGCCCGCTCGACCTCCCGCTGAAGGTCTACATCGGCGCCGACGACGA 124 80 
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RPRPRPLDLPLKVYIGADDD 



124 81 CGGCACCGACTGGCGCACCACCCTGGGCTGGCGCGCGTGCACCGCCCGGGACTGCGAGGT 1254 0 
GTDWRTTLGWRACTARDCEV 

12541 CGTCGTCCTGCCCGGCGGCCACTACTTCCTGGAGACCGACCGCGCGGCCGTCCTCACCCG 126 00 
VVLPGGHYFLETDRAAVLTR 



12601 CGTCGCC ACGG ACCTCG C CG AAG CCG AGGTAGGGG CATG AC CGCGCGCGTCG ACGC CACA 12 660 
VATDLAEAEVGA* 

MTARVDAT 
(orf 36) 

12 661 CCCACCTACCTGGCGGTGCTGGCGGTGCGCGAGGCCCGCGCCCCGCTCCTCGGCAGCTGC 12720 
PTYLAVLAVREARAPLLGSC 

12721 CTGGCCCGCATGTCCTTCGCGGTGCTGCCGCTCGCCCTGCTGCTGTCGGTCCGGGACGCG 12780 
LARMS FAVLPLALLLSVRDA 



12781 ACGGGGTCGTTCGCCGTCGCCGGACTGACCTCCGGCGCGCTGTCGGCCACGCTCACGCTG 1284 0 
TGSFAVAGLTSGALSATLTL 



TTCGCGCCCGCCCGCGCCCGGCTGATCGACCGCCGGGGCTCACGGTCCGGACTGGTCCGG 12 900 
FAPARARLIDRRGSRSGLVR 

CTGACCGTCCCGTACCTGCTGGGGCTCGCCGTGCTGATCACATTGGCCGAGGCGGAAGCG 12 96 0 
LTVPYLLGLAVLITLAEAEA 



CC C AC CGCGGCG CTG CT CGTCGC CG CCG CGGT CGCGGG CGTGTT CGCGC CG C CG CTCGGT 13 020 
PTAALLVAAAVAGVFAPPLG 

CCGACCATGCGCGTGCTGTGGGCGAGGATCCTGCACGGCCGTCAGCCCCTGCTGCACACC 13 080 
PTMRVLWARILHGRQPL LHT 

GCCTACGCCCTCGACTCCGTCACCGAGGAGGTGGTCTTCACCGTGGGGCCGCTGCTGGCG 1314 0 
AYALDSVTEEVVFTVGPLLA 



GGCGGCCTGATCGCGGTCGCGGCACCGCTCGCGTCGATGATCACGGTCATGGTGCTGATC 132 0 0 
GGLIAVAAPLASMITVMVLI 

GCGGCCGGTACCGCCTGCTTCGTGCTGTCCGCCGCGACCGCCGCCGCCCCCGCGTCGGGC 1326 0 
AAGTACFVLSAATAAAPASG 



13261 GAAGCCGACGAGGACCGGCCGCACGGCCGGCCCATGGCTCTGCCCGGGATGCGCACGATC 1332 0 
EADEDRPHGRPMALPGMRTI 

13 321 GTGCTGTCCTTCGGCGGCGTCGGCCTGGTCGTCGGGGTGCTCCAGGTCGTCCTGCCGTTC 133 8 0 
VLSFGGVGLVVGVLQVVLPF 

13 3 81 ATCGCCGACCACGCGGGCTCGCCCGGCGCGGGCGGCATCCTGCTGTCCATGCTGTCGGCG 1344 0 
IADHAGS PGAGGILLSMLSA 

13441 GGCAG CG CGGTCGG CGG C CTCGC CTACGGG CGGATCG C CTGG CG CTCGACG CCCGTG CGG 13 500 
GSAVGGLAYGRIAWRSTPVR 

13 501 CGGTTCGTGGTGCTCGTCACCGGGTTCACGCTGGCGGTGCTGCCGCTGTGCCTGACCGCG 13560 
RFVVLVTGFTLAVLPLCLTA 

13 561 AGCCCGGTGCCGGCCGGGGCCTTCGCCCTCCTCGTGGGACTCTGCCTCGCCCCGCTGTTC 1362 0 
SPVPAGAFALLVGLCLAPLF 

13 621 ACCACCG CCT ACCTG CTGGTCAACGACCTGGTGACGG CGTCGGGGAC CG CACCCAC CG AG 13680 
TTAYLLVNDLVTASGTAPTE 



12841 



12901 



12961 



13021 



13081 



13141 



13201 
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13681 GC CAACAC CTGGGTCTC CACGGCCAATAACGG AGGGTTCG CCG CGGGCAGCGCCG C CGCC 13740 
ANTWVSTANNGGFAAGSAAA 



13741 GGTGTGCTGCTCGACTCCCGGGGCCCCACCGTCACCGTCACCGCCGCGTTCGCGGTCGCC 13 8 00 
GVLLDSRGPTVTVTAAFAVA 

13 801 G CCGCGACCGC CGT CATG ACCGTTCTG CGCCG CCGG ACCCTG CTC CTCGGCG CCGGACAC 13 860 
AATAVMTVLRRRTLLLGAGH 

13861 CCCGAACCGGCCGCCGCCACACCCGCCGACCGCACCGCACCCGCCGAAGCCGAGGAGTGA 13 92 0 
PEPAAATPADRTAPAEAEE* 

13 921 ACCGATCGTGTCCAAGAACGCGGCGCACTGGTCGCGCATCCGCACAGGGGACGCCCCCGG 13 980 
MS KNAAHWSRI RTGDAPG 
(orf 37) 

13 981 CGTCGTACTCGCCGTGGACTTCTACGGAACGGGCCGCCAGGAAGCCACCTTCCGCCACCT 14 04 0 
VVLAVDFYGTGRQEATFRHL 

14 041 GTGCGACCTGCTCACGGATCCGGTCGAGGTCTGGCACGCGGTCCCGCCCGCCCCGGACGG 14100 
CDLLTDPVEVWHAVPPAPDG 

14101 CGACTGGTCCACGGCCACCGGCGCCGGTCACCTGCGCTGGTGGACCGAGGGGCTCGACAC 14160 
:Jrj DWSTATGAGHLRWWTEGLDT 

14161 GGTCCTCGCGGGACGGCCGGTGCGGGCCCTCGTCGGCTACTGCGCGGGCGGCGTCTTCGC 14 22 0 
■' h ~l VLAGRPVRALVGYCAGGVFA 

...... 

j'fi 14221 CTCGGCCCTCGCCGACGCCCTCGTCGAACGGGAGGGCCACCGGCCGCGGGTCGTGCTGTT 14 2 80 
'.iV. SALADALVEREGHRPRVVLF 

i U 

||L 14 2 81 CAACCCCAGCGCGCCCGGCGTCGCCACGCTCACCCGCGACTTCCGCGGTCTGATCGCCGG 1434 0 
NPSAPGVATLTRDFRGLIAG 

l "t 

KJ 14 341 CATGGACCTCCTCACGGACGGGGAACGCGCCGCTCTGCTGGCCGAGACGACCGCGATCCG 144 00 
IP MDLLTDGERAALLAETTAIR 



144 01 GCGGGCACACGCCCCCGACGCGCTGGTACCGGTCGCCGAACGCTACGCCGCCCTGTACCG 144 6 0 
RAHAPDALVPVAERYAALYR 

144 61 CG AGGG CTG CG AC CT C CTGTG CGAG CGG CTCGGCGTGGACG C CT C CTTCGG CG CCG AACT 14 52 0 
EGCDLLCERLGVDASFGAEL 



14 521 GGCCGCCGTCCTCCACTCCTACCTGGCCTACCTCACGGCGGCGCTCGACGTGCCCCCCAC 14580 
AAVLHSYLAYLTAALDVPPT 



14 581 CCCGCTGTGGCGCGGCGCCGTCTCGCTCACCTCCCGCGAGCACCAGGGCACCGACTTCAC 1464 0 
PLWRGAVSLTSREHQGTDFT 

14 641 CGACGTCGAGCACGGCTTCGACGTCGCCCGTGCCGAACTGCTGAGCTCCCCCCAGGTCGT 14 700 
DVEHGFDVARAELLSSPQVV 

14 701 CGCGGCGCTGACCGCGCTCCTCCGCGAACACGAGGCGAGCCGATGACCCTCACCCTGCGG 14 7 60 
AALTALLREHEASR* 

M T L T L R 
(orf38) 

14 761 GACGCCTTCCTCGACCAGGCCGCCCGGACCCCCGACGCCCACGCCGTCGTACACGGCGAC 14 82 0 
DAFLDQAARTPDAHAVVHGD 

14821 ACTGTATGGACGTACCGCGAACTGGAACTGCGGGCCGGCCGCATGGCCCGGACGCTGGCC 14 880 
TVWTYRELELRAGRMARTLA 
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14 881 GCACGCGGCGCGGGCCCCGGCACGCTGGTGGCGGTACGCCTGCCGCGCGGTCCCGAACCG 14 94 0 
ARGAGPGTLVAV RL PRG PE P 



14 941 GTCGCCGCGCTCCTCGCGGTCGTGCTGACGGGAGCGGGCTACGTGCCGCTCGCCGACGAC 15000 
VAALLAVVLTGAGYVPLADD 

15001 GAC CCGCCGGAC CGGTGCCGG C AC ATCCTCG ACGACTG CGC CG CCG CGCTG CTGCTGGC C 15060 
DPPDRCRHI LDDCAAALLLA 

15061 GAGCACCCCTCGCGGGACGGACGCACCCTCACCCCGGACGAGGCGCTGGCACCCGCCCGC 15120 
EHPSRDGRTLTPDEALAPAR 



15121 CCGTTCGACGCGGCCCCGGTGCGGGCCGGCGACCCGGCGTACGTGATCTACACCTCCGGC 15180 
PFDAAPVRAGDPAYVI YTSG 



15181 TCCAGTGGCCGTCCGAAGGGCGTGCTGGTCGAACAGGGCGCGCTCGGCGCCTACCTGGCA 1524 0 
SSGRPKGVLVEQGALGAYLA" 

15241 CAGGCCCGCGCGCGCTACGACGGGCTGTCCGGACGGACGGTGCTGCACTCCTCGCTGTCC 153 0 0 
QARARYDGLSGRTVLHSSLS 



153 01 TTCGACATGGCCGTGACCAGTCTGTGGGGCCCGCTCGTCAGCGGCGGCGCGATCCACGTG 153 6 0 
FDMAVTSLWGPLVSGGAIHV 

153 61 CTCGACCTGAAGGCGATCGCCTCCGGCACCCAGCCGCCGCCCGCCGCCTCGGCACGTCCG 1542 0 

LDLKAIASGTQPPPAASARP 

154 21 TCCTTCCTCAAGGTCACTCCGTCCCACCTGCCGCTGCTGGGCCTGCTGCCGGACTCCTGC 154 8 0 

SFLKVTPSHLPLLGLLPDSC 

154 81 CTGCCCACCGGGCAACTCGTGATCGGCGGCGAGGCGCTGACCGGCTCCGCGCTCGGACCC 1554 0 
LPTGQLVIGGEALTGSALGP 

15541 TGGCGCGCCGCGCACCCCGACGTCACGGTCGTCAACGAGTACGGGCCCACCGAGGCGACC 156 0 0 
WRAAHPDVTVVNEYGPTEAT 

15601 GTCGGCTGCTGCGCGTACACCGTCCGCCCCGGTGACGCCGTGGACCCGGGTGCCGTCCCC 15660 
VGCCAYTVRPGDAVDPGAVP 

15661 ATCGGACGG C CGTTCG CGGGCACCCG C CTGTACGTG CTCGACG CGG ACGG CGAGC CGGTC 15720 
IGRPFAGTRLYVLDADGEPV 

157 21 G CCGTGGG CGGTGTGGGTG AACTG C AC AT CGCGGG CGACCAGTTGG CG CG CGG ATACCTG 15780 
AVGGVGELHIAGDQLARGYL 

15781 GGGCG CC CGCGG CTG AC CG AGG AACG CTTCGT C CCGGAC CCGTT CG CCGC CGACGG CT C C 1584 0 
GRPRLTEERFVPDPFAADGS 



15841 CGGATGTACCGCACCGGCGACCTGGTGCGCGAACGCCCGGACGGCGACCTGGAGTACCTC 15 900 
RMYRTGDLVRERPDGDLEYL 



15901 GGGCG CGCGG ACGGG CAGGTG AAGGTCTC CGGGT AC CGG ATCG AG C C CGGCGAG ATCG AG 1596 0 
GRADGQVKVSGYRIEPGEIE 



15 961 GCCGTGCTCCGCGGCCACGCGGGGGTGAGGGACTGCGCGGTCGTCGCCGTCGGCGAGGCG 16 02 0 
AVLRGHAGVRDCAVVAVGEA 

16021 GACGCCCGCCGGCTCGTCGCCTACGTGGTACCGGACCCGGACTCCCCGCCCGGCACCGCC 16080 
DARRLVAYVVPDPDS PPGTA 



16081 GCGCCGGCGCGGCACG CGG CCGAGGCGCTG C CGC CGT ACATGGTGCCGG CG ACGTTCGT C 1614 0 
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APARHAAEALPPYMVPATFV 



16141 ACCGTGCCCGAACTGCCGCTCACCCCCAACGGGAAGCTCGACCGGGACGCGCTGCCCGGC 162 00 
TVPEL PLTPNGKLDRDALPG 

16201 C C C CCTGC CGG CG ACGC CGGG CCGGGCGAC CGCAC CC CGGC CG AG AC C CTG CTGTGCG AG 16260 
PPAGDAGPGDRTPAETLLCE 

16261 CTGCTGGCACGGGCCCTGGGCATCCCGGAGATCGACGCCGACGCCGACTTCCTGACGTCC 1632 0 
LLARALGI PEIDADADFLTS 

16321 GGCGGCACCAGCATCACCGCGCTGAAGCTGGTCGCCGGCGCCCGCCGGGTCGGCATCCGC 1638 0 
GGTS ITALKLVAGARRVGI R 

16381 CTCGAACTCACCACCGTC CTG CGCGAACGCACGGTGCGCCGCATCCTGG CGGC CCAGCCC 1644 0 
LELTTVLRERTVRRI LAAQP 

164 41 GACGCCGC CTCG CCC CTCG C CG AAGG AGTG C C CGAGTG AC CGGTTCCGT AACG CTC ACCC 16500 

MTGSVTLTP 
DAAS PLAEGVPE * (orf39) 

16501 CCCTCGGCGGGATCATCCCCAGGCCCCGCGGCGAGGGGCTCACCACCGGCGCCGAGTACG 1656 0 
LGGI I PRPRGEGLTTGAEYD 

16561 ACCTGGGGCCGCTCGGCGACGCGGGCCCCGACTGgGTGCGGGCCCACGGCCCGCGACTGC 1662 0 
LGPLGDAGPDWVRAHGPRLR 

16 621 GCGAGCGCCTCGCCACCGACGGGCTGATCCTGCTGCACGGTCTGCCCACCGACGGAGACG 16680 
ERLATDGL I LLHGLPTDGDG 

16681 GCGTCGACGGCTTCCACGACGTCGTCGGCTCCGTCGGCGGCGACCCGCTGCCCTACACCG 1674 0 
VDGFHDVVGSVGGDPLPYTE 

16741 AGCGCTCCACCCCGCGCAGCGTGGTCAAGGGCAACATCTACACCTCGACCGAGTACCCGG 16 80 0 
RSTPRSVVKGNIYTSTEYPA 

168 01 CCGACCAGCCCATCCCGATGCACAACGAGAACTCCTACGCCGCCCATTGGCCGTCCACGC 16 860 
DQPI PMHNENSYAAHWPS TL 

168 61 TCTACTTCTTCTGCCACACCGCGCCGGACACCGGCGGGGCCACGCCGATCGCCGACGGCC 1692 0 
YFFCHTAPDTGGATP IADGR 

16921 GCGCCGTCCTCGACCTCATCCCGGCCGAGGTCAGGCGGCGGTTCTCCCAAGGGGTCGTCT 1698 0 
AVLD L I PAEVRRRFSQGVVY 

16981 ACACCCGTACGTTCCGCGCCGACATGGGACTGAGCTGGCAGGAAGCGTTCCAGACCGAGG 1704 0 
TRTFRADMGLSWQEAFQTED 

17041 AC CG CGG CGACGTCG AACGCC ATTGCCGCG C C C ACGGC C AGG AGTTCTC CTGGG ACGGCG 17100 
RGDVERHCRAHGQEFSWDGD 

17101 ACGTCCTGCGCACCCGCCACCACCGCCCGGCGACCGCCGTCGACCCCGGCACCGGAGCCG 17160 
VLRTRHHRPATAVDPGTGAE 

17161 AGGTGTGGTTCAACCAGGCGCACCTGTTCCACCCGTCCAGCCTGGATCCCGACCTGCGCC 17220 
VWFNQAHLFHPSSLDPDLRQ 

17221 AGGTGCTCCTGGAGACGTACGGCGAGAACGGCCTGCCCCGCGACGCCCTGTTCGCCGACG 172 80 
VLLETYGENGLPRDALFADG 

17281 GCACCCCGATCCCCGACGCCGACCTGGCAACGGTCCGCGCGGCCTACACCCGCGCCGCGC 1734 0 
TPI PDADLATVRAAYTRAAL 
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17341 TCGCGCTGCCGTGGCGAGAGGGCGACATCATGCTGGTCGACAACCTGAGGATGGCCCACG 174 00 
ALPWREGDIMLVDNLRMAHG 

17401 GCCGCGAGCCCTTCACCGGCGAGCGCCGCGTACTCGTCGCGATGACCTCGGCGGACtCAT 17460 
REPFTGERRVLVAMTSADS * 

174 61 GAGCCGTGCCGACGCATCGGCACGCCGTCCTCCCGTCGGGGCGCTACCATCGCCGCTGTC 1752 0 

17521 TCGGCCATCACCCCACCCGGGCGGAGGCAACCGGCCGTGCACATCCCCGCCGTGGTCGCC 17 58 0 

17581 ACGGCACGCGCGATCACCCGCGCCATGACCGCCCAGCCCGTTGTCACATCTGCGGAGGCG 1764 0 

17641 C CGCG ATGACAG AGGT CCGAGGTG AACTG ATCCGGG CG CTGCCGGGTGTG CTGG AGGCGC 17700 
MTEVRGELIRALPGVLEAR 
(orf40) 

177 01 GTGCGGCGCGGGCGGGGCACACGACCGCCTTCCTCGACGCACGACGGTGTGTCACGTACC 1776 0 
AARAGHTTAFLDARRCVTYR 

17761 GGGAGTTGGAGGCGCGCACCCGCCGGCTGGCGGGGTCACCTGGTGCGGTTGGGGGTGCGC 1782 0 
ELEARTRRLAGS PGAVGGAQ 

17821 AGGGGCAGACCGGGTGGCGCTCGTCAATGGGCAACCGGGGTGGAGATGGCGGAGGGTTCC 1788 0 
GQTGWRS SMGNRGGDGGGFP 

17881 CTCCCCGGTGCTGCGGGCCGGAGCGGTAGGGGTGCCGCTCGATTCCGGGGCCACGGACGC 1794 0 
PRCCGPER*GCRSI PGPRTR 

17941 GGAGCTCGCGTACTTCCTCGACGACTGTGGAGCGGTGGCGGTGGTCACCGAGGAGACGCT 18000 
SSRTSSTTVERWRWS PRRRC 

18 001 GCTGCCGCGGGTCTCGCGATCGGCGGGCGTACGGATCCTGGTGGGGGGTTCGGACGCCGT 1806 0 
CRGSRDRRAYGSWWGVRTPS 

18061 CCCGGAGGGAGCGGCTGCCGGCATCCACTCCTTCGAGCGGCTCGCGGCGTCGGATCCGGG 1812 0 
RRERLPASTPSSGSRRRIRG 

18121 GTGCGCGCCACGGGACGACCTCGGCCTCGACGAGCCGGCCTGGATCCTCTACACGTCGGG 1818 0 
ARHGTTSASTSRPGSSTRRG 

18181 GACCACGGGCCGGAGCAAGGGCGTGGTCTGCGGCCAGCGCGCCGCGCTGTGGTCCGTGGC 1824 0 
PRAGARAWSAASAPRCGPWR 

182 41 GGCGGCGTACGTGCCGTCGTGGGGTCTGGGGCCGCAGGACCGGCTGTTGTGGCCGCTGCC 183 00 

RRTCRRGVWGRRTGCCGRCP 

183 01 CATGTTCCACGCCTACGCGCACTCGCTGTGCCTGCTCGGGGTGGTGGCCGTGGGCGCGAG 183 60 

CSTPTRTRCACSGWWPWARA 

183 61 CGCGTACCTCCTCGACCGGGGCGCGAGCGTCGTCCGGGCGCTTGAGGAACAGCGGTGCAG 184 2 0 

RTSSTGARASSGRLRNSGAA 

184 21 CGTCGTGGCCGGTGTACCCGCCACCTACCGCCTGCTCACGAGCGCCTTCCGCGACGCCCC 184 8 0 

SWPVYPPPTACSRAPSATPP 

184 81 CCGGCCACCGGCCGGCCTGCGACTGTGCGTCACCGGGGGCTGCGCCGTGCCCGCCGGGGC 1854 0 
GHRPACDCASPGAAPCPPGL 

18541 TGCGGG CGGACGTTGAGG AGCTG CTGGGCGTC CCGCTG CTCGACGGTT ACGG CAGT AC CG 18600 
RADVEELLGVPLLDGYGSTE 
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18601 AGACCTGCGGCAAGATCACGG* 
T C G K I T V 



SEQ ID NO: 3 BLM gene PPTase ORFS 41 



i 

81 

161 

241 
1 

321 
27 

401 

53 

481 
80 

561 
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160 

801 
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881 
213 

961 
240 

1041 

1121 

1201 

1281 

1361 

1441 

1521 

1601 

1681 

1761 




IAGCGGCTCGGCGGCTCCCGGGAGGGCGGTTGCCGGG 
RLGGSREGGCR 



18660 



GGATCCTGCGCTACCCGGACTTCGCtCAGTGGTGCGGCACCGAGCTCACCGCCGACTGGCACGTCCGCTTCCGGGCCGCC 8 0 

GCCGCGGTCTACGGGCATCTGCACA'icCCCCGCGTGACCCGGTACGACGGCGTCCGCTTCGAGGAGGTGTCGGTCGGCTA 16 0 

CCCGCGCGAGTGGCGGCCCCGGCCGCCCCGCGAGCCGCTCCGGCAGATCCTGCCCCAGCCCGTCGACGAGCCGGGAGCCC 24 0 

TCTGGTGATCGCCGCCCTCCTGCCCTGCTGGGCCGTCACCGAACACGCCTTCACCGACGCCCCGGACGACCCGGTGAGCC 320 

MIAALLPS \W AVTEHAFTDAPDDPVSL 26 

TCCTCTTCCCCGAGGAGGCCGCCCACG'nCGCCCGCGCCGTCCCCAAGCGCCTGCACGAGTTCGCCACCGTCCGGGTGTGC 4 00 

LFPEEAAHVAARAVPKRLHEFATVRVC 52 

GCCCGCGCCGCCCTCGGCCGGCTGGGCcicCCGCCCGGTCCGCTGCTGCCCGGCCGACGGGGCGCGCCGAGCTGGCCGGA 4 8 0 

ARAALGRLGLlPPGPLLPGRRGAPSWPD 79 

CGGGGTGGTGGGGAGCATGACGCACTGTCAGGGCTTCCGGGGCGCCGCGGTCGCCCGGGCCGCCGACGCCGCGTCGCTCG 560 

GVVGSMTHCQ\GFRGAAVARAADAASLG 106 

GGATAGACGCCGAGCCGAACGGGCCGCTCC tGGACGGCGTCCTCGCCATGGTCTCGCTGCCGTCCGAGCGCGAGTGGCTC 64 0 

IDAEPNGPLP\DGVLAMVSLPS EREWL 132 

GCCGGACTGGCGGCCCGCCGGCCGGACGTGckcTGGGACCGGCTGCTGTTCAGCGCCAAGGAGAGCGTCTTCAAGGCGTG 720 

AGLAARRPDVH\wDRLLFSAKESVFKAW 159 

GTACCCGCTGACCGGCCTGGAGCTGGACTTCGACGAGGCCGAGCTGGCCGTCGATCCGGACGCCGGGACGTTCACGGCCC 8 00 

YPLTGLELDFDVeAELAVDPDAGTFTAR 186 

GGCTGCTGGTGCCGGGACCGGTGGTCGGCGGCCGTCGGCTGGACGGGTTCGAGGGGCGCTGGGCGGCGGGCGAGGGCCTC 880 

LLVPGPVVGGRlRLDGFEGRWAAGEGL 212 

GTCGTCACGGCCATCGCCGTCGCGGCGCCGGCCGGTACCGCGGAGGAATCGGCGGAAGGGGCCGGGAAGGAAGCGACTGC 960 

VVTAIAVAAPAG\TAEESAEGAGKEATA 239 

GGACGACCGGACCGCCGTCCCGTAAACCGCCCCGAUcACCGGCGTGGCGCCCGCCGACCGTGTCGGGGGCGCCACGAACG 104 0 

DDRTAVP* I 247 

GGCGCCGGCCCGGCGGGCCCTCCGCCGTGCGGAGCGGAGGCCCGGCGCGGACGCGCCCGGTGTCGTCGGATACGTGCGTC 1120 

AGTCGGCG ACG CAG ACGTTGCCGTTGGTCG AGTTGA® CAG C C CG ACG ATGTCG ATGGTGTTG CCG CAG AGGTTG ATGGGG 12 00 

ATGTGGACGGGGATCTGGATGACGTTGCCCGAGACGAlCGCCCGGGGAGCCGACGGCCGCCCCCTTGGCGTTCGAGTCGGC 12 80 

GAGGGCGGTGCCGGAGACGCCGGCGAGCGCCGTGCCCACGGTGGCGGTGAGGGCCGCTGCCTTGGCGATTCGTGACATGG 13 60 

GGTGACACCTTCGTTCGGTCTGACAGGGTCGAGCTCAqpGCCTCTGACGGCCGGGAGCCCGGATCAACGCCCGATCACCC 144 0 

CGAAGGTTTCGAATCGTGCGGCGGACGGGTGACCGGCGGCCGAACGGCCTCGCCGGGCCCCCGGAAGGTGCCATGACGTC 152 0 

CGTGCG CCATCTGT ACAG C C CGGTC C CG CG C CG CGT ACA^GGG ACGG ACGGACGG CCGGTGG ACGG ACGAC CGG CGGGG A 1600 

GGGGAGGCCATGAGCCGGATCGCGATCGTCGGGGCGGGTCAGGCCGGACTGCATCTGGCGCTGGGGCTGCTGGGGGCGGG 168 0 

GAGCGGCTCTTCCCGTCACGAGGTGCTGCTCGTGTCCGACGGGACGCCGGACGAGATCCGCGCCGGGCGGGTGCGGTCGA 1760 



1761 



56 



